Unit 10 


Sequences and series 


Introduction 


Introduction 


This unit introduces the concept of a sequence, which is the mathematical 
name for a list of numbers arranged in a particular order. The list can stop 
at a particular number, or it can contain infinitely many numbers. Here 
are some of the sequences discussed in the unit: 


Oe a Tia 
7.15, 10.50, 13.85, 17.20, ..., 245.00 
1000, 950, 900, 850, ..., 0 


1000, 1050, 1102.50, 1157.63, 1215.51 
2000, 1400, 980, 686, 480.2, ... 
1, 2, 6, 24, 120, ... 
1 1 1 1 
4’ 8 16’ 32’ °° 
1, 7, 21, 35, 35, 21, 7, 1. 
Here the three dots ... stand for ‘and so on’, to indicate that the list 


continues. If the dots are followed by a number, then that number 
terminates the sequence. Otherwise, the sequence continues indefinitely. 


Several of the sequences above arise from practical situations, such as 
estimating the height of a bouncing ball on successive bounces. Each of 
them has a definite mathematical pattern, and you may like to try to spot 
the pattern now in each case. (However, do not spend longer than a few 
minutes on this. The patterns may become harder to spot as you move 
down the list.) You'll see later what pattern is associated with each 
sequence. 


In Section 1 you’ll meet two different ways, known as closed forms and 
recurrence systems, to specify sequences using formulas. A closed form is a 
formula for each term (individual number) in a sequence, whereas a 
recurrence system indicates what the first term is and how each subsequent 
term is related to the previous term in the sequence. 


In Section 2 you'll study two special types of sequence, known as 
arithmetic and geometric sequences, which include the majority of the 
examples above. 


Section 3 explores how sequences can be visualised, and how they behave 
in the long term, that is, when a large number of terms are considered. 


In some cases, it’s possible to calculate the sum of the terms of a sequence, 
even if there are infinitely many of them. An expression that’s obtained by 
adding the terms of a sequence, such as 1 +3 +5+ 7+9, is known as a 
series. Series are the subject of Section 4. 


In Section 5 the focus moves from sequences to the binomial theorem. This 
important result enables you to multiply out expansions such as (1 + 2)4, 
(a +b)" and (2y — 3)° quickly. 
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1 What is a sequence? 


In this section you'll meet some notation used for sequences, and two 
different ways to specify sequences using formulas. 


1.1 Sequence notation 


We frequently see lists of numbers arranged in order. For example, a list of 
numbers can be used to represent a quantity that’s varying over time, such 
as the midday temperature (in °C) at a particular location, recorded each 
day for a week: 


13, 12, 10, 10, 10, 9, 8; 


or the amount of money (in £) in a savings account on each 1 January 
over a five-year period: 


1000.00, 1050.00, 1102.50, 1157.63, 1215.51. 


In mathematics, a list of numbers is called a sequence, and each number 
in the list is called a term of the sequence. 


A sequence that has a finite number of terms (and hence has a last term as 
well as a first term) is called a finite sequence, whereas a sequence that 
has an infinite number of terms is called an infinite sequence. 


For example, 
13, 12, 10, 10, 10, 9, 8 


is a finite sequence, whose first term is 13 and whose last term is 8. On the 
other hand, the sequence of odd natural numbers, 


1, 3, 5, 7, Dy aes 


goes on forever, with first term 1 but no last term, so this is an infinite 
sequence. For each term in the sequence there is always a next term, an 
odd number 2 greater than the one before, so the sequence has no end. 


Many sequences have a structure, or pattern, that allows us to give a 
concise description of the sequence and helps us to understand its 
behaviour. For example, the sequence 1,3,5,7,9,... has a simple 
mathematical structure; each term is exactly 2 more than the previous 
term: 


14+2=3, 34+2=5, 54+2=7, 7+2=9, 


The savings account sequence 1000, 1050, ..., 1215.51 also has a simple 
structure, though this is less obvious. For this sequence, you can check 
that, to three significant figures: 


1050 _ 1102.50 _ 1157.63 _ 1215.51 _ 
1000 "1050? 1102.50? 1157.63 
so each term is 1.05 times the previous term (approximately, at least). 


1.05, 


This pattern arises because the savings account pays compound interest of 
5% per annum, and there have been no withdrawals. We consider this 
sequence again in Section 2. 


The last two fractions above are not exactly equal to 1.05 because the 
numbers in the sequence have been rounded to two decimal places, that is, 
to the nearest penny. Without this rounding the sequence would be 


1000.00, 1050.00, 1102.50, 1157.625, 1215.506 25, 
for which each term is exactly 1.05 times the previous term. 


There’s no reason to suppose that the temperature sequence above has any 
simple mathematical structure, since it is obtained by making independent 
measurements each day. In this unit, we’ll investigate various types of 
sequence that do have simple underlying mathematical structures. Such 
sequences arise in various real-world situations and have many applications. 


First, however, here’s some notation associated with sequences. In 
elementary algebra, we use letters such as a,b,c, x,y,z, A,B,C, and so on, 
to represent variables. With a sequence, we represent the terms by using 
one particular letter with an attached subscript; this subscript is an integer 
that indicates which term of the sequence is referred to. Thus the sequence 
@1, 42, 43, ---, Q10 
has 10 terms, the first being a; and the last being aio. This notation is 
called subscript notation (or suffix notation). The terms above are 
read as: a-one (or a-sub-one), a-two (or a-sub-two), and so on. (You’ve 
seen subscripts used previously, for coordinates in Unit 2, and for vector 
components in Unit 5.) 


Sometimes it is possible to choose an appropriate letter for a sequence. For 
example, you might use t for the temperature sequence. Since the first 
term is 13, you write tı = 13, and so on, giving 

tj) = 13, tg=12, t3=10, t4=10, t =10, te =9, t7 =8. 
You can use either upper- or lower-case letters to represent sequences. The 
use of appropriate letters can be helpful, especially when dealing with 
several sequences, but it is not necessary and indeed not always possible. 


If you want to refer to a general term of a sequence, rather than a 
particular term, then you use a letter for the subscript as well, as follows: 


an denotes the term of the sequence with subscript n. 


Here n represents a natural number in the appropriate range; for example, 


for the sequence a1, d2,...,@19, the range of n is 1,2,...,10. In 
mathematics, the letter n often represents a natural number or, more 
generally, an integer; that is, one of the numbers ...,—2,—1,0,1,2,.... 


Other letters commonly used to represent integers are i, j, k, l, m, p and q. 


The notation a, denotes an individual term of a sequence — the term with 
subscript n. You can represent a whole sequence by using notation such as 
the following: 


(ay,)+_, denotes the finite sequence a1, a2, a3, ..., Q17 


(dn); denotes the infinite sequence a1, a2, a3, .... 
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Because infinite sequences occur frequently, we use the abbreviated 
notation (an), with no subscript values, to mean (a,,)°°,. That is, 


(an) denotes the infinite sequence a1, a2, a3, .... 


The variable n in the notation for a sequence is sometimes referred to as 
the index variable for the sequence. It’s a dummy variable: you can 
change it to any other variable name you like without changing the 
meaning. For example, 


(Gn )n=1> (ai)? and (ax) e241 


all denote the same sequence a1, @2,a3,.... (The idea of a dummy variable 
featured also in Unit 8.) 


Activity 1 Using sequence notation 


Consider the infinite sequence 

(bna) with terms 1, 4, 7, 10, 13, 16, 19, .... 
(a) Write down the values of bı and b4. 
(b) For which value of n is bn = 16? 


(c) Can you write down the value of bo? 


1.2 Closed forms for sequences 


The notation a, is useful when you want to specify a sequence by giving a 
formula for the terms. For example, suppose that you want to specify the 
infinite sequence of perfect squares: 


1, 4, 9, 16, 25, .... 
If you choose to represent this sequence using the letter s (for square), then 
= 17, s9=2?, o3=37, se = 4?, 


For a general natural number n, we have sp = n?, which is a formula for 
the general term, called the nth term, of the sequence. To complete the 
specification of the sequence, you need to state the range of values of the 
subscript n. You do this using brackets, as follows: 


Sn=n? (n=1,2,3,...). 


Note that it’s important to include the range of values of n here. It tells 
you that for this particular sequence the first term corresponds to n = 1 
and that the sequence continues indefinitely, rather than stopping after 
some number of terms. 


A formula like sn = n?, for defining a sequence in terms of the subscript n, 
is called a closed form (or a closed-form formula). A closed form for a 
sequence enables you to calculate any term of the sequence directly, once 
you’re given the value of n. Unfortunately, however, not all sequences have 
such a formula. 
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Closed form for a sequence 


A closed form for a sequence is a formula that defines the general 
term da, as an expression involving the subscript n. To specify a 
sequence using a closed form, two pieces of information are needed: 


e the closed form 


e the range of values for the subscript n. 


Here are some examples for you to try. 


Activity 2 Using closed forms 
For each of the sequences specified by the following closed forms and 
ranges of values of n, find the first five terms and the 100th term. 
(a) ün = m M= Lp 23e) (b) bn = Ln. {n=l 23] 
(gale (n=1,2,3,...) (d) dn =(-1)"n (n= 1,2,3,...) 
(6) en = (2) (e125 352.2) 
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Notice the device used in Activity 2(c)—(e) to specify a sequence whose 
terms alternate in sign (that is, are alternately positive and negative). The 
expression (—1)" has value —1 when n is odd and 1 when n is even, and 
the expression (—1)"*! has value 1 when n is odd and —1 when n is even. 
In part (e), (—2)” = (—1)"2”. 

If you’re given some terms of a particular sequence, then you may be able 
to spot a closed form for the sequence by recognising the pattern involved. 
It’s helpful to consider questions such as the following. 


e Is the nth term a constant multiple of n or of some fixed power of n? 


e Is the nth term a constant multiple of the nth power of some fixed 
number? 


e Do the terms alternate in sign? 


Activity 3 Spotting closed forms 


For each of the following sequences, given by the first four terms, try to 
spot a closed form for the sequence. Denote the nth term of the sequence 
by an in each case. Then use your closed form to find the 10th term in the 
sequence. 


(Don’t spend more than a short time attempting each part. Some of the 
closed forms are harder to spot than others.) 


(a) 1, 2, 3, 4, 20 (6) 2, 4, 8, 16,... (2) —1, 1, —1, 1, ... 
(d) 1, 8, 27, 64, ... (e) 6, —6, 6, —6, ... (f) 3,3, 3, 5, 
(g) 2, —4, 6, 8 ... 


When you write down a closed form for a sequence, you should usually also 
write down the subscript range, as demonstrated in the solution to 
Activity 3, so that you have fully specified the sequence. 


So far you’ve represented sequences using the subscript 1 for the first term. 
For example, you’ve seen that you can represent a sequence as 

a1, @2,03,.... This seems natural and easy to remember, but there are 
occasions when it is convenient to be flexible about how you represent the 
first term. Consider, for example, the sequence 


1, 2, 4, 8, 16,.... 
This is a sequence of powers of 2: 

OF oe eee 
You can specify this sequence in closed form, using a, = 2”, but only if 
you start numbering the subscripts from 0 rather than from 1: 

an=2" (n=0,1,2,...). 


In this case, the simplicity of the formula a, = 2” generally outweighs any 
inconvenience of starting with ap. There is the possibility of confusion in 


having ao as the first term of a sequence, with a; as second term, and so 
on, but with practice this should not cause difficulties. If it were 
considered essential to have the first term with subscript 1, then you could 
specify the sequence as follows: 

bal” (n=1,2,3,...). 
Then, for example, bı = 2)~! = 2° = 1. 
Further flexibility about the subscript for the first term of a sequence is 
sometimes useful, as illustrated in the next activity. 


Activity 4 Using a subscript other than 1 for the first term 


(a) For each of the following closed forms and ranges of values of n, find 
the first three terms of the sequence specified. 
0) an=3" (n=0,1; 2, coc) 

O 1 

~ n(n—1) 


‘at 1 
(ii) cn = aria 


(= 2, 34 e) 


(n=1,2,3,...) 


(b) For each of the following sequences, given by the first four terms, try 
to spot a closed form for the sequence, using the notation specified. 
Then use your closed form to find the sixth term in the sequence. 
(Don’t spend more than a short time attempting each part.) 


(i) 1, 3, 9, 27, ... (general term dn, first term do) 
1 1 1 1 
(ii) Bg gn (general term en, first term es) 
1 1 1 1 
(iii) T g Ie? 739? (general term fn, first term f2) 


The sequences in parts (a)(ii) and (a)(iii) of Activity 4 illustrate the fact 
that two sequences whose descriptions appear to be different at first sight 
can actually have exactly the same terms. This activity also highlights the 
importance of including the range of values of n in the definition of the 
sequence. 


It’s sometimes useful to convert a closed form for a sequence into a 
different closed form for the same sequence, which uses a different 
subscript for the first term. Here’s an example. 
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Cc) Example 2 Changing the subscripts used for a sequence 
o Consider the sequence given by the closed form 
C= Gi A ee ee 
For the same sequence, find a closed form 
by =... (n=0,1,2,...). 
Solution 


@. The value of n corresponding to each term is reduced by 1 in 
moving from (an) to (bn), so you need to replace each occurrence of n 
in the formula for the term by n + 1, to leave the term unchanged. ® 


The required closed form is 
bn = (n+ 1)\(n+1+2) (n=0,1,2,...); 
that is, 
na = tS) S Ve) 
@. As a check, the closed form for (an) gives 
Gy Sid, CohS2<4, daSe x} SS, 
The closed form obtained for (bn) gives 
MHI M1 BHBX4, Y= 3X5, 


which is the same sequence. ® 


Activity 5 Changing the subscripts used for sequences 


(a) For each of the sequences (an) given by the following closed forms, 
write down a closed form bn =... (n=0,1,2,...) that specifies the 
same sequence. 


(i) an=2n (n=1,2,3,...) 

(ü) t=O) (n=1,2,3,...) 
(iii) an=5+n (n=1,2,3,...) 
(iv) an = 4(n—- 1) (n=1,2,3,...) 


(b) For each of the sequences (an) given by the following closed forms, 
write down a closed form bn =... (n = 1,2,3,...) that specifies the 
same sequence. 


(i) an =04" (n=0,1,2,...) 
ii) dp = om (eo =O l; 2e) 


2 


( 

és 1 

(iii) an == (n=0,1,2,...) 
(iv) an =2+3n (n=0,1,2,...) 


10 


(c) For each of the sequences (an) given by the following closed forms, 


write down a closed form bn =... (n = 2,3,4,...) that specifies the 
same sequence. 
3r 
(i) an n + 1 (n bd , ? ) 
1 
(ii) an (m= 0, 12 ac) 


~ (n + 1)(n + 3) 


It’s implicit in what we’ve done so far that where a simple pattern is 
evident from the first few terms of a sequence, that pattern is assumed to 
continue unchanged. For example, in Activity 3(d) you were asked to find 
a closed form for the sequence 1,8, 27,64,.... The answer 


an =n? (n=1,2,3,...) 


was found by spotting that the first four terms are the cubes of 1, 2,3, 4, 
respectively, and assuming that this pattern continues. However, 
alternative closed forms can be written down that give the same first four 
terms; one of these is 


bn = n? + (n — 1)(n— 2)(n—3)(n—4) 8") 


The terms of the sequence (bn) differ from those of (an) from the fifth term 
onwards, so these are different sequences. In general, no finite number of 
terms can describe a sequence without ambiguity, whereas a sequence 
described by a closed form with a subscript range is unambiguously 
defined. Where sequences are described by giving the first few terms, we 
shall always assume that any simple pattern that is evident continues for 
the remainder of the sequence. 


1.3 Recurrence relations for sequences 


Some sequences have the property that each term (after the first) can be 
obtained from the previous term by using a formula. For example, consider 
the sequence with closed form an = 7n (n = 1,2,3,...). Its terms are 


7, 14, 21, 28, 35, .... 


Each term (after the first) of this sequence can be obtained from the 
previous term by adding 7: 


az =a, +7, ag=agt+7, a@a4=đa3 +7, 


If a term (after the first) has subscript n, then the previous term has 
subscript n — 1, so you can write 


an Opa? (n= 2 3A): 


Note that the range of values of n here begins with 2 rather than 1, 
because 2 is the first value of n for which the equation an = an-1 +7 
applies. 
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In a similar way, the sequence with closed form b, = 2” (n = 0,1,2,...), 
whose terms are 


1, 2, 4, 8, 16, ..., 


has the property that each term (after the first) is always twice the 
previous term: 


bı = 2bọ, b2 = 2b,, b3 = 2bo, 
Hence you can write 
bn = bn- (m= 123a): 


A formula like an = an—1 +7 or bn = 2bn—1, which allows each term of a 
sequence (after the first) to be obtained from the previous term, is called a 
recurrence relation. It’s more specifically a first-order recurrence 
relation because it involves only the immediately preceding term of the 
sequence. (A second-order recurrence relation would involve the preceding 
two terms, and so on.) In this module, the words ‘recurrence relation’ 
always refer to a first-order recurrence relation, unless otherwise stated. 


If you have a recurrence relation for a sequence, and you also know the 
first term of the sequence, then in principle you can determine any term of 
the sequence by starting from the first term and repeatedly applying the 
recurrence relation. For example, from the recurrence relation 


Tn = z2; (n=2,3,4,...), 
and the first term xı = 2, you can successively calculate 
ry = 2, 
r= a =2? —4, 
z3 = r2 = 4° = 16, 
z4 = x = 16° = 256, 
and so on. Here the expression z2, for example, means (2 )?, the square 
of x}. 


Notice that if you keep the same recurrence relation but change the first 
term, zı, then you obtain a different sequence; for example, with xı = 1, 
the recurrence relation above gives 72 = 1, #3 = 1, z4 =1,.... 


Taken together, the specification of a first term, a recurrence relation and 
the range of values of n for which the recurrence relation applies is called a 
recurrence system, and the resulting sequence is called a recurrence 
sequence. We display the three parts of a recurrence system as follows: 


zı = 2; Tn Soe (n=2,3,4,...); 


the first term of the sequence is on the left, and the recurrence relation and 
the range of values of n are on the right. If the first term of the sequence is 
labelled not as xı but as xo, say, then the range of values of n has to begin 
with 1 rather than 2, as follows: 


ey = 2, taat 4 m= 1,9,3.40.): 
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Recurrence system for a sequence 


A recurrence relation for a sequence is an equation that defines 
each term other than the first as an expression involving the previous 
term. To specify a sequence using a recurrence system, three pieces 
of information are needed: 


e the value of the first term 
e the recurrence relation 


e the range of values for the subscript n. 


Example 3 Using recurrence systems for sequences o) 


For each of the following recurrence systems, find the first five terms — 
of the sequence specified. 
(a) o = 1; ia = Om a mD o= aaa) 

1 


(b) i=2, bn => Ce 


n—1 
Ce) e= Cp nnn Wr 1 28a) 
Solution 
@. Apply the recurrence relation for each value of n in turn. ® 
(a) ay = 1 
ag = i CON) ae is 
a3 =9xXa2—-—2=13 
C—O a — 05) 
Os =) XGA = 2 Sal 


Œ) w 2 
b =1/b1 = 4 
bs = Io = 2 
b4 = 1/b3 = 4 
bs = Io = 2 
le) eq = tl 


a= lkol 
Cols x 0 — 2 
G =I G= 
GHA Xx & = H 
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Activity 6 Using recurrence systems 


For each of the following recurrence systems, find the first five terms of the 
sequence specified. 


(a) ay =0, an = 2an—-1 +1 (n =2,3,4,...) 
(b) b =1, bn =b2_,-1 (n=2,3,4,...) 
(c) 29 = 2, Cn = $ (Cn—1 +2/cn-1) (w= 1,2,3,.-<) 


In part (c), round the terms to six decimal places when writing them 
down, but maintain full calculator precision when calculating each term. 


The sequence in Activity 6(c) has the property that it very rapidly gives 
good approximations to v2, as you can check by finding v2 on your 
calculator. A proof of this property is outside the scope of this module, 
but it illustrates an important use for recurrence systems, namely, that 
sequences defined by using recurrence systems can sometimes be used to 
calculate approximations to certain irrational numbers to as many decimal 
places as may be required. 


The next box summarises what you have seen in this section. 


Three ways to specify a sequence 
A sequence (an) can be specified by giving one of the following. 


e The values of the first few terms, if we assume that any simple 
pattern that is apparent continues. 


e A closed form, 
an = expression in n, 
and a subscript range. This permits the value of a, to be found 
directly for any value of n in the subscript range. 


e A recurrence system, consisting of the value of the first term, a 
recurrence relation, 


an = expression involving an_1, 


and a range of values of the subscript n. This permits the value 
of a, to be found from the value of an-ı for any value of n in the 
subscript range. 


It’s often (though not always) convenient to take the first subscript of a 
sequence to be 1, so to avoid constant repetition we adopt the following 
convention. 


2 Arithmetic and geometric sequences 


Convention 


The first term of a sequence (an) is taken to be a; unless indicated 
otherwise. 


In the next section, you’ll study two particular types of recurrence system, 
and investigate whether closed forms can be found for the corresponding 
sequences. Many sequences defined by recurrence systems do have closed 
forms, but not all. 


2 Arithmetic and geometric sequences 


In this section, we consider two types of sequence that occur frequently in 
practice. 


2.1 Arithmetic sequences 


We begin with two sequences that arise in different ways but are of a 
similar mathematical type. 


First, consider the finite sequence 
7.15, 10.50, 13.85, 17.20, ..., 245.00. 


This sequence could represent the heights in metres above ground level of 
successive floors in a very tall building, from the first floor upwards. 

(The terms are similar to the heights in metres of the habitable floors of 
the Shard, in London.) We’ll call this sequence (hn), so hy = 7.15, 

hə = 10.50, h3 = 13.85, and so on. 


To get from any term in this sequence to the next, we add the same 
number each time: 


7.15 + 3.35 = 10.50, 
10.50 + 3.35 = 13.85, 


The Shard, with St Paul’s 
13.85 + 3.35 = 17.20, Cathedral on the left 


and so on. The number 3.35 occurs here because it is the height difference 
in metres between successive floors. Thus this sequence can be defined by 
the recurrence system 


hy =7.15, hn =hn-1 +3.35 (n =2,3,4,...,72). 


The last value in the range of n here is 72 because, as will be confirmed 
later in this subsection, the last term in the sequence is h72. (The 72nd 
floor of the Shard is about 245 metres above ground level. The total height 
of the building is 310 metres.) 
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Next, consider the finite sequence 
1000, 950, 900, 850, ..., 0. 


This sequence could represent the volume, measured in litres on successive 
Saturdays, of oil in a tank supplying a boiler that uses 50 litres of oil per 
week. We’ll call this sequence (vn), so vı = 1000, v2 = 950, v3 = 900, and 
so on. 


Once again, to get from any term in this sequence to the next, we add the 
same number each time: 


1000 + (—50) = 950, 
950 + (—50) = 900, 
900 + (—50) = 850, 


and so on. The negative number —50 occurs here because each week the 
volume in the tank is reduced by 50 litres. Thus this sequence can be 
defined by the recurrence system 


vı = 1000, Un =Un-1 — 50 (m=2, 34, -,21). 


The last value in the range of n here is 21 because 1000 litres at 50 litres 
per week lasts 20 weeks, so the last term in the sequence is v21 = 0. 


Any sequence with the structure demonstrated above — the addition of a 
fixed number to obtain the next term — is called an arithmetic 
sequence, or alternatively an arithmetic progression. (In this context, 
‘arithmetic’ is pronounced with emphasis on the syllable ‘met’.) Thus a 
general arithmetic sequence is given by the recurrence system 


r= 4, = et tad m= 28 ess) 


where a is the first term and d is the number that’s added to each term to 
give the next term. That is, d is the difference £n — 2,_ ; between any pair 
of successive terms, usually called the common difference of the 
sequence. Choosing the values of the first term a and the common 
difference d determines a particular arithmetic sequence; we call a and d 
the parameters of the arithmetic sequence. For example, the floor heights 
sequence has parameters a = 7.15 and d = 3.35, and the oil volumes 
sequence has parameters a = 1000 and d = —50. 


An arithmetic sequence (£n) can be finite, as for the floor heights sequence 
and the oil volumes sequence, or infinite. Also, the first term can be zo 
rather than xı, in which case the sequence is given by 


zo =a, a= tni ba (a= 1; 23st): 


Activity 7 Recognising arithmetic sequences 


Which of the following recurrence systems define arithmetic sequences? 
For each arithmetic sequence, write down the values of the first term a and 
common difference d. 


(a) zı = —1, TEn = Tn-1 +1 (n= 2,3,4,...,100) 


2 Arithmetic and geometric sequences 


(b) y =2,  Yn=-—Yn-1 +1 (n=2,3,4,...) 
(c) 29=1, Zn = eo — 1.5 (AH 1,2,3,...) 


Suppose now that you have been given the first few terms of an arithmetic 
sequence (zn). How can you find its parameters a and d? Well, a is just 

the first term, and d is the difference £n — £n—1 between any two successive 
terms £n and £n—1 in the sequence, as illustrated in the following example. 


Example 4 Finding parameters of arithmetic sequences Cc) 


(a) For the infinite arithmetic sequence (£n) whose first four terms 
are 
100, 95, 90, 85, 
find the values of the first term a and common difference d, and 
write down the corresponding recurrence system. Calculate also 
the next two terms of the sequence. 


(b) Repeat part (a) for the infinite arithmetic sequence (yn) whose 
first four terms are 


ee 3) 
Ty go ad 
Solution 


@. By the convention stated on page 15, take the first term of each 
sequence to have subscript 1. 


Remember that to specify a recurrence system, you have to write 
down three things: the first term, the recurrence relation and the 
subscript range. ® 


(a) The first term is a = 100, and the common difference is 
d = 95 — 100 = —5. So a suitable recurrence system is 


a = OO, =F) S 
The next two terms are 
te= m D= DSI 
ze = £5 — 5 = 80 — 5 = 75. 
(b) The first term is a = 4, and the common difference is 
d= 5 — i = i So a suitable recurrence system is 
yl =F, Yn = Yn-1 +4 (m = 2 34a oa) 


The next two terms are 


Ys =ystq=l+ z= 
yo=ystg=Rtaa 


viw eI 
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Activity 8 Finding parameters of arithmetic sequences 


(a) For the infinite arithmetic sequence (xn) whose first four terms are 
1, 4, 7, 10, 


find the values of the first term a and common difference d, and write 
down the corresponding recurrence system. Calculate also the next 
two terms of the sequence. 


(b) Repeat part (a) for the infinite arithmetic sequence (yn) whose first 
four terms are 


2.1, 3.2, 4.3, 5.4. 


(c) Repeat part (a) for the finite arithmetic sequence (zn) with eleven 
terms, whose first four terms are 


1, 0.9, 0.8, 0.7. 


A special type of arithmetic sequence arises when the common difference d 
is zero. In this case, each term of the sequence is equal to the first term. 
Such a sequence is called a constant sequence. For example, 


3, 3, 3, ... 


is a constant sequence. 


Finite arithmetic sequences 


As you’ve seen, the method of finding the values of the parameters a and d 
is the same for any arithmetic sequence, whether it’s finite or infinite. For 
an infinite sequence, the recurrence system has the form 


ti =ü; In = in-i ta (m=2 3 Ass); 


whereas for a finite sequence, with N terms say, the recurrence system has 
the form 


r1=4, CHW (= 23 


Here the final number in the range of values of n is N because the final 
term xy is obtained by applying the recurrence relation with n = N. 


If you have the first few terms and the last term of a particular finite 
arithmetic sequence, and you want to find a recurrence system that 
specifies the sequence, then you need to find not only the values of the 
parameters a and d, but also the subscript of the last term. To see how to 
do this, consider the floor heights sequence, 


7.15, 10.50, 13.85, 17.20, ..., 245.00. 


To find how many terms there are in the sequence, you need to work out 
how many times the common difference 3.35 has been added to the first 


2 Arithmetic and geometric sequences 


term 7.15 to produce the last term 245.00. The total amount added is 
245.00 — 7.15, so the number of times that 3.35 has been added is 


245.00 — 7.15 
——_.——_ = 71. 
3.35 
So there are 71 terms of the sequence after the first term, and hence there 


are 72 terms altogether. The corresponding recurrence system for the floor 
heights sequence is therefore 


hy =7.15, 9 An =hn-1 +3.35 (n= 2,3,4,...,72), 
as stated earlier. In general, the number of terms of a finite non-constant 
arithmetic sequence is given by 


last term — first term 
number of terms = ——————— 
common difference 


That is, if a finite arithmetic sequence has N terms, with first term z1, last 
term zy and common difference d, then 


eee pall 1 
te (1) 


You'll meet this equation in a slightly different form in the next subsection. 


Activity 9 Finding the number of terms in a finite arithmetic 
sequence 
(a) Find the number of terms in the finite arithmetic sequence 
1000, 970, 940, 910, ..., 10. 


(b) Hence write down a recurrence system for this sequence, denoting the 
nth term by £n. 


2.2 Closed forms for arithmetic sequences 


Arithmetic sequences have a particularly simple form: to get from one 
term to the next, you add the same number each time. This pattern allows 
you to obtain closed forms for such sequences, which makes them easier to 
handle mathematically. 


To see how to do this, consider the sequence 
5, 8, 11, 14, ..., 38. 


This is a finite arithmetic sequence with parameters a = 5, d = 3 and 
12 terms, which can be described by the recurrence system 


by =5, bn = bn-1 +3 (= 28 Aare 2): 


So bı = 5, b2 = 8, and so on, up to b12 = 38. The way in which the terms 
of this sequence are obtained from the recurrence relation can be pictured 
as follows. 
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add 3 add 3 add 3 add 3 


a 8 a: uU n 
bi bə bs bn 


Figure 1 Obtaining the terms of an arithmetic sequence 
Starting from bı = 5, 

to obtain b2, we add 3, 

to obtain b3, we add 3 twice, 

to obtain b4, we add 3 three times, 


and so on. In each case the number of added 3s is one fewer than the 
subscript on the left. Thus, to obtain the general term bn, you have to 
add 3 exactly n — 1 times; that is, you add 3(n — 1). This gives the value 
of the general term as 


bn =5+3(n—1) (n=1,2,3,...,12), 
which can be simplified to 
by = 3n 2 n = 1/253) vas p12), 


(Notice that the first value in this range of n is 1, whereas the first value in 
the range of the recurrence relation was 2.) 


For example, using this closed form we find that b4 = 3 x 4 + 2 = 14, as 
expected. 


The reasoning above can be applied to a general arithmetic sequence, to 
obtain a formula for the nth term. Consider the arithmetic sequence given 
by the recurrence system 


=a, t= Bea ee nS 28a) 


Figure 2 shows how each successive term is obtained from the term before. 


add d add d add d add d 


le le oe n =| a 
Ti T2 T3 Tn 


Figure 2 Obtaining the terms of a general arithmetic sequence 


To obtain a general term £n, you start with zı = a and add d exactly 
n — 1 times, so £n =a + (n — 1)d. 
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Closed form for an arithmetic sequence 
The arithmetic sequence with recurrence system 

i = Op aan Se ee t= 2, 3.4) 
has the closed form and subscript range 


f,=atin— Nd (a= 1,253,...): 


If you have a finite arithmetic sequence, with N terms say, then it has the 
closed form stated in the box above, but a finite subscript range: 


In =a+(n—l1)d (n= 1,2,3,..., N). 
It’s easy to check that the formula in the box above gives the correct 


answers for the first few terms. For example, when n = 1 and n = 2, the 
formula gives the correct values xı = a and rg = a + d. 


You can simplify the expression a + (n — 1)d in the closed form for an 
arithmetic sequence when a and d have particular values. For example, 


5 +3(n—1) =3n4+2. 


The next example and activity are about applying the closed form in the 
box above to particular sequences. 


Example 5 Finding a closed form for an arithmetic sequence A 


Find a closed form for the arithmetic sequence 5, 9,13, 17,..., given — 
by the recurrence system 


= By, yan (aH 2 


Check that your answer gives the correct value for the fourth term, 
and also calculate the 10th term of the sequence. 


Solution 


@. Apply the closed-form formula zn = a + (n — 1)d 
(i= 12,3) 


Since a = 5 and d = 4, the closed form is 
In = 5+4(n — 1) 
= ey 2 8a) 
This gives z4 = 4 x 4 + 1 = 17, as expected, and also 
feng = 4) >< IQ +1=41 
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Activity 10 Finding a closed form for an arithmetic sequence 
(a) Find a closed form for the arithmetic sequence 1,4,7,10,..., given by 
the recurrence system 
r=1, Tnm Brzi tS a a ee! ee 


Check that your answer gives the correct value for the fourth term, 
and also calculate the 10th term of the sequence. 


(b) Repeat part (a) for the arithmetic sequence 2.1, 3.2, 4.3,5.4,..., given 
by the recurrence system 


yı = 2.1, Yn = n-i tll (= 2, 3 Aes: 


(c) Repeat part (a) for the arithmetic sequence 1, 0.9,0.8,0.7,...,0, given 
by the recurrence system 


z= l, n= Fat — OL. (w= 234l): 


(You were asked to write down the recurrence systems for these sequences 
in Activity 8.) 


Note that if a finite arithmetic sequence has last term æy, then the closed 
form that you’ve met gives 


TN =a+(N —1)d, 
which can be written as 
ZN = LiF (N — 1)d. 


This is a rearrangement of equation (1) on page 19, so the two formulas 
are really just saying the same thing. 


Finally, note that an arithmetic sequence with first term a and common 
difference d has an alternative closed form that’s sometimes useful. The 
closed form that you’ve met, 


In =a +(n—l1)d (a= 1,2,3,...), 


holds when the subscript n takes values starting from 1. If instead you 
choose to have the subscript n start from 0, then the closed form is 


Yn =atnd (n=0,1,2,...). 
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2.3 Geometric sequences 


Next we investigate a different type of sequence. We begin once again with 
two sequences from real-world contexts. 


First, consider the savings account sequence, 
1000.00, 1050.00, 1102.50, 1157.63, 1215.51. 


This represents the amount of money (in £) in a savings account on each 
successive 1 January over a five-year period. As pointed out in 

Subsection 1.1, the terms of this sequence after the third term have been 
rounded to two decimal places, since they are amounts of money and hence 
need to be expressed to the nearest penny. The corresponding sequence 
with unrounded values is 


1000, 1050, 1102.5, 1157.625, 1215.506 25. 


We shall call this unrounded sequence (sn), so sı = 1000, s2 = 1050, 
s3 = 1102.50, and so on. 


To get from any term in this sequence to the next, we multiply by the 
same number each time: 


1.05 x 1000 = 1050, 
1.05 x 1050 = 1102.50, 
1.05 x 1102.50 = 1157.625, 


and so on. The number 1.05 occurs here because the interest added at the 
end of each year is 0.05 times (that is, 5% of) the amount in the account 
at the beginning of the year. Thus this sequence can be defined by the 
recurrence system 


sı = 1000, sna = LUSsg4 (= 2,3,4,5): 


Here the range of values of n stops at 5, since the last term in the sequence 
is ss. Note that, where rounding is required, you should carry out the 
complete calculation for each term using exact arithmetic first and round 
only at the end, rather than rounding at each application of the recurrence 
relation. 


Next, consider the sequence 
2000, 1400, 980, 686, 480.2, .... 


This could represent the heights, measured in millimetres, of successive 
bounces of a ball that is assumed to rebound to 70% of the height from 
which it falls. We shall call this sequence (hn), so hy = 2000, hg = 1400, 
h3 = 980, and so on. 


Once again, to get from any term in this sequence to the next, we multiply 
by the same number each time: 


0.7 x 2000 = 1400, 
0.7 x 1400 = 980, 
0.7 x 980 = 686, 
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and so on. The number 0.7 occurs here because each successive height is 
70% of the previous one. Thus this sequence can be defined by the 
recurrence system 


hı = 2000, hn =0.7hn-1 (n=2,3,4,...), 


where we have assumed (unrealistically) that the ball will bounce infinitely 
many times. 


Any sequence with the structure demonstrated above — multiplication by a 
fixed number to obtain the next term — is called a geometric sequence, 
or alternatively a geometric progression. Thus a general geometric 
sequence is given by the recurrence system 


v1 =q, i=in (m= 23A) 


where a is the first term and r is the number by which you multiply each 
term to obtain the next term. That is, r is the constant ratio £n/£n—1 of 
any two successive terms, often called the common ratio of the sequence. 
Choosing the values of the first term a and common ratio r determines a 
particular geometric sequence; we call a and r the parameters of the 
geometric sequence. For example, the savings account sequence (sn) has 
parameters a = 1000 and r = 1.05, and the bouncing ball sequence (hn) 
has parameters a = 2000 and r = 0.7. 


A geometric sequence (£n) can be finite, as for the savings account 
sequence, or infinite, as for the bouncing ball sequence. The first term can 
be zo rather than xı, in which case the sequence is given by the recurrence 
system 


xo =Q; HT =L Byes) 
Notice that when r = 1 we obtain the constant sequence a,a,a,..., SO 


constant sequences are not only a special type of arithmetic sequence, as 
you saw earlier, but also a special type of geometric sequence. 


Activity 11 Recognising geometric sequences 


Which of the following recurrence systems define geometric sequences? For 
each geometric sequence, write down the values of the first term a and 
common ratio r. 


(a) zı = —1, Ce = 31 (NE A 3A) 
(b) yo = 1, Yn = —0.9yn-1 (n=1,2,3,...) 
(c) 2 = 2, Zn = —2n-1 +1 (n=2,3,4,...) 


Suppose now that you know the first few terms of a geometric 

sequence (zn) and you want to find its parameters. As with arithmetic 
sequences, the parameter a is just the first term. The other parameter, r, 
is the ratio of any pair of successive terms. 


The next example and activity are about finding the parameters for some 
geometric sequences, and so obtaining recurrence systems for these 
sequences. 
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Example 6 Finding parameters of geometric sequences A 
(a) For the infinite geometric sequence (£n) whose first four terms are — 
2 -8 La = 6.70. 
find the values of the first term a and common ratio r, and write 


down the corresponding recurrence system. Hence calculate the 
next two terms of the sequence, to three decimal places. 


(b) Repeat part (a) for the infinite geometric sequence (yn) whose 
first four terms are 


100, 99, 98.01, 97.0299, 
calculating the next two terms to three significant figures. 
Solution 


@. Using the convention stated on page 15, take the first term of each 
sequence to have subscript 1. 


Remember that to specify a recurrence system, you have to write 
down three things: the first term, the recurrence relation and the 
subscript range. ® 


(a) The first term is a = 2, and the common ratio is 
r = (—3)/2 = —1.5. So the recurrence system is 


ei = 2. C= or ee — 2a ae 
The next two terms are 
os = loan — 1.5 (6:75) = 10.125 
x6 = —1.545 = —1.5 x 10.125 = —15.1875 = —15.188 (to 3 d.p.). 


(b) The first term is a = 100, and the common ratio is 
r = 99/100 = 0.99. So the recurrence system is 
u = O O00 eae = 2 ones 
The next two terms are 
Ys = 0.99y4 
= 0.99 x 97.0299 
= 96.059 601 = 96:1 (to 33.1), 
UB = 0.99y5 
= 0.99 x 96.059 601 
= 95.099004... = 95.1 (to 3 s.f.). 


25 


Unit 10 Sequences and series 


26 


Activity 12 Finding parameters of geometric sequences 


(a) For the infinite geometric sequence (£n) whose first four terms are 
1, $, F, $, 
find the values of the first term a and common ratio r, and write down 
the corresponding recurrence system. Calculate also the next two 
terms of the sequence. 


(b) Repeat part (a) for the infinite geometric sequence (yn) whose first 
four terms are 


4.2, 7.14, 12.138, 20.6346. 


Give the next two terms to three decimal places. 


(c) Repeat part (a) for the infinite geometric sequence (zn) whose first 
four terms are 


2, —2, 2, —2. 


Finite geometric sequences 


The method of finding values for the parameters a and r is the same for 
any geometric sequence, whether it is finite or infinite. However, for a 
finite sequence you may need to find how many terms there are, as you did 
for finite arithmetic sequences. 


For example, suppose that the savings account sequence is changed by 
lengthening the period over which the account balance accumulates, giving 
this sequence of annual balances: 


1000.00, 1050.00, 1102.50, 1157.63, ..., 2078.93. 


Here, as before, the amounts are rounded, after their calculation, to two 
decimal places (that is, to the nearest penny, since the amounts are 
measured in £). As before, the interest rate is 5% per year, and hence each 
term (before rounding) is 1.05 times the previous term. How can you find 
the number of years for which the money is kept in the account? In other 
words, how many terms are there in the sequence? 


Apart from the rounding that takes place, this is a finite geometric 
sequence with first term 1000, last term 2078.93 and common ratio 1.05. 
In order to determine how many terms there are in the sequence, N say, 
you need to find how many times sı = 1000 has to be multiplied by 1.05 in 
order to obtain sy = 2078.93. Since progressing from sı to sy involves 

N — 1 multiplications by 1.05, we have 


1000 x 1.0571 = 2078.93. 


This is an exponential equation of a type that you saw how to solve in 
Subsection 4.4 of Unit 3. Dividing through by 1000 and taking the natural 
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logarithm of both sides gives 


2078.93 
1.0571 = =2, 
a 07893 
(N — 1) In 1.05 = In 2.078 93 
In 2.078 93 
N =1+ = - 16. to 4d.p.). 
+ n LOS 6.0000 (to 4d.p.) 


(Recall that the value of sy has been rounded to two decimal places, so it 
is not surprising that the outcome for N is not exactly an integer.) 


Hence there are 16 terms in the sequence, and the corresponding 
recurrence system is 


sı = 1000, s,=1.05s,-1 (n=2,3,4,...,16). 


The same argument can be applied in the general case. If the finite 


geometric sequence £1, 2, £3,..., £y has common ratio r, then you obtain 
zy from zı by multiplying by r exactly N — 1 times. That is, 
TN = xır =e 


So you can find the number of terms, N, in the sequence by solving the 
following equation for N: 

rN le 2N. 

Tı 

You can always do this by using logarithms, as above (except in the simple 
special cases where x; = 0, r = 0 or r = +1). However, care is needed if 
r <0. In this case, you can find N by solving the equation 
|r|N-+ = |2n /x1| by using logarithms. 


Here is an example for you to try. 


Activity 13 Finding the number of terms in a finite geometric 
sequence 


(a) Find the number of terms in the finite geometric sequence (zn) whose 
terms are 
7, 35, 175, 875, ..., 2734375. 


(b) Hence write down a recurrence system for this sequence. 


The situation posed above for the savings account, in which sy = 2078.93 
is known but N is unknown, is somewhat artificial. A more realistic 
question is: how many years will it take for an initial sum of £1000, placed 
in a savings account paying 5% interest per year, to double in value? In 
other words, what is the smallest value of N for which sy > 2000? 


This question can be answered with a similar approach to that above, by 
solving the equation 


1000 x 1.05%~! = 2000. 
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The solution is given by 
In2 
In 1.05 


This shows that when N = 15 the account balance has not yet reached 
twice its initial value, whereas when N = 16 it has more than doubled. 


N=1+ = 15.2067 (to 4d.p.). 


2.4 Closed forms for geometric sequences 


Geometric sequences, like arithmetic sequences, have a particularly simple 
form: to get from one term to the next, you multiply by the same number 
each time. This pattern allows you to obtain a closed form for such 
sequences. 


To see how to do this, consider the bouncing ball sequence, 
2000, 1400, 980, 686, 480.2, .... 


This is a geometric sequence with parameters a = 2000 and r = 0.7, which 
can be defined as 


hı = 2000, Ryn =0.7hn-1 (n=2,3,4,...). 


The way in which the terms of this sequence are obtained from the 
recurrence relation can be pictured as follows. 


x 0.7 x 0.7 x 0.7 x 0.7 
2000 }~ m |1400 —s| 980 | —a ee a-o 
hy ho hs ae, 


Figure 3 Obtaining the terms of the bouncing ball sequence 
Starting from hı = 2000, 

to obtain hg, you multiply by 0.7, 

to obtain h3, you multiply by 0.7 twice, i.e. by 0.77, 

to obtain h4, you multiply by 0.7 three times, i.e. by 0.7°, 


and so on. To obtain the general term hn, you have to multiply by 0.7 
exactly n — 1 times; that is, you multiply by 0.7”~!. This gives the value 
of the general term as 2000 x 0.7"~!, so the closed form is 

hn = 2000 x 0,7""* (n =1,2,3,...). 
For example, using this closed form you find that h4 = 2000 x 0.73 = 686, 
as expected. 


The reasoning above can be applied to a general geometric sequence to 
obtain a formula for the nth term. Consider the geometric sequence given 
by the recurrence system 


Ti =, a= Pipa, ME 3A a) 
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Figure 4 shows how each successive term is obtained from the term before. 


xT xr XT ASR 
oo ae uU 
Li T2 T3 Tn 


Figure 4 Obtaining the terms of a general geometric sequence 


To obtain a general term £n, you start with zı = a and multiply by r 
exactly n — 1 times, so £n = ar™"!. 


Closed form for a geometric sequence 
The geometric sequence with recurrence system 

n — a, a n ee) 
has the closed form and subscript range 


eee (m= 122e) 


If you have a finite geometric sequence, with N terms say, then it has the 
closed form stated in the box above, but a finite subscript range: 


£n =ar"! (n= 1,2,3,..., N). 


Once again, you can check that this formula gives the correct answers for 
the first few terms. For example, in the cases n = 1 and n = 2, the formula 
gives the correct values xı = a and z2 = ar. Also, if the ratio is r = 1, then 
you obtain the constant sequence £n = a (n = 1,2,3,...), as expected. 


The next example and activity are about applying the closed form in the 
box above to particular sequences. 


®© 
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Activity 14 Finding a closed form for a geometric sequence 


(a) Find a closed form for the geometric sequence 1, 5; i; $, ..., given by 
the recurrence system 
zı =l, Cn = itn- (m= 234a a): 


Check that your answer gives the correct value for the fourth term, 
and also calculate the 10th term of the sequence (correct to four 
significant figures). 


(b) Repeat part (a) for the geometric sequence 
4.2, 7.14, 12.138, 20.6346, ..., given by the recurrence system 


yı = 4.2, Yn =L Tyn- (n= 2,3; fes): 
(c) Repeat part (a) for the geometric sequence 2, —2, 2, —2, ..., given by 
the recurrence system 
Zz = 2, Zn = — ep (n = 2,3,4, oe): 


(You were asked to write down the recurrence systems for these sequences 
in Activity 12.) 


Finally, note that the geometric sequence with first term a and common 
ratio r has an alternative closed form that’s sometimes useful. The closed 
form that you’ve met, 


a an (co ey ee eae 8 


holds when the subscript n takes values starting from 1. If instead you 
choose to have the subscript n start from 0, then the closed form is 


yn= ar" (m= 01,2): 


3 Graphs and long-term behaviour 


3 Graphs and long-term behaviour 


In this section you’ll see how the information contained in a sequence can 
be plotted on a graph. You'll then look at what can be said about the 
behaviour of a sequence after a large number of terms. 


3.1 Graphs of sequences ea 
In Unit 3 you were introduced to the idea of a function. You can think of a 
any sequence as a function whose domain is the set of natural numbers 

{1,2,3,...}. For example, you can think of the sequence 


1, 4, 7, 10, 13, ... 


as the function for which the input 1 gives the output 1, the input 2 gives Ai oes 


the output 4, the input 3 gives the output 7, and so on, as shown in the 


mapping diagram in Figure 5. Figure 5 A sequence viewed 


: , , as a function 
In general, the sequence (xn); defines a function for which each input 


number n gives the output zn, as shown in Figure 6. If you want the 

subscript of the first term of the sequence to be 0 rather than 1, then you 

should take the domain of the function to be {0,1,2,...} rather than ae. 
{1,2,3,...}. You can make similar adjustments for other possible ranges of 


EF 
subscripts. fei 
a 


Functions can be represented by graphs, and this is in particular true of 
sequences. Each term £p corresponds to a point (n, £n) on the graph. The 
next example demonstrates how to plot a graph for a sequence. 


inputs outputs 


Figure 6 A general sequence 
viewed as a function 
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Activity 15 Plotting the graph of an arithmetic sequence 


Plot a graph for the first six terms of the arithmetic sequence given by the 
recurrence system 


z4=1, Zn = %m-1 — 0.1 (n = 2,3, 4500) 


Notice that, in each of Example 8 and Activity 15, the graph of the 
sequence consists of points that lie on a straight line. This happens for 
every arithmetic sequence. To see why, consider the arithmetic sequence in 
Example 8, which is given by 


ta =n. = Leese h: 


This equation defines £n as a linear function of n, as introduced in 
Subsection 1.6 of Unit 3. So it is the equation of a straight line, with 

n and £n in place of the usual variables x and y, respectively. You can 
read off the gradient, 3, in the usual way. (You can also read off the 
vertical intercept, but it has no relevance here since 0 is not in the domain 
of the function.) However, the graph of the arithmetic sequence consists 
only of those isolated points on the line that have first coordinate 

n = 1,2,3,..., rather than the whole straight line. This graph and the 
straight line on which it lies are shown in Figure 7. 


3 Graphs and long-term behaviour 


EnA 


20-4 


Q—2>_—____—— a= ee 
(aes a a A a Mes 


Figure 7 The graph of an arithmetic sequence lying on a straight line 


In general, the arithmetic sequence with first term a and common 
difference d has the closed form 


Zn =a+(n—l1)d (n= 1,2,3,...), 
which can be rearranged as 
In = dn + (a-d) (n=1,2,3,...). 
So its graph consists of points that lie on a straight line with gradient d. 


Note that in the particular case d = 0, the straight line has gradient zero 
and so is horizontal. Correspondingly, the arithmetic sequence is a 
constant sequence, a, a, @,.... 


We turn next to the graphs of geometric sequences. 
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In each part of the next activity, your first task is to calculate the terms 
for which the corresponding points are to be plotted. 


Activity 16 Plotting the graphs of geometric sequences 


(a) Plot a graph for the first six terms of the sequence given by the 
recurrence system 


yı = 4.2, Yn = l.Tyn-1 (n= 2,3,4,...). 


(b) Repeat part (a) for the first five terms of the sequence given by the 
closed form 


Ge =E 1983202), 


In Example 9 the common ratio of the geometric sequence is r = z and the 
graph is decreasing, whereas in Activity 16(a) the common ratio is r = 1.7 
and the graph is increasing. In each case, the graph consists of points that 
lie on the graph of an exponential growth or decay function. You saw in 
Subsection 4.6 of Unit 3 that an exponential growth or decay function is a 
function of the form f(x) = ae**, where a and k are non-zero constants. 
Such a function can also be written in the form f(x) = ab”, where a and b 
are constants with a Æ 0, b > 0 and b Æ 1. If b> 1, then f is an 
exponential growth function, whereas if 0 < b < 1, then f is an exponential 
decay function. 


For example, consider the sequence from Example 9. It has the closed form 
= aye r= 1523s 


which can be rearranged as £n = 2 (5)", and this is the formula for an 
exponential decay function, with n and x, in place of the usual variables 
x and y, respectively. (Comparing this equation with the general form of 
an exponential decay function, from above, we have a = 2 and b = 2.) 


3 Graphs and long-term behaviour 


However, the graph of the geometric sequence consists only of those 
isolated points on the exponential decay curve that have first coordinate 
n =1,2,3,..., rather than the whole curve. The graph and the 
exponential decay curve on which it lies are shown in Figure 8. 


TrA 


Figure 8 The graph of a geometric sequence lying on the graph of an 
exponential decay function 


As another example, consider the sequence (yn) from Activity 16(a). It has 
the closed form yn = 4.2 x 1.7"~!, which can also be written as 

Yn = (4.2/1.7) x 1.7”; that is, Yn = = x 1.7". So the points of this graph 
lie on the graph of the exponential growth function f(x) = ab? with a = # 


17 
and b = 1.7. 


In general, the geometric sequence with first term a and common ratio r 
has the closed form 


tn =ar"! (n=1,2,3,...), 


which can be rearranged as 
In = (2) P= (WEA a, 
T 


provided that r #0. If r > 0 and r Æ 1, then the graph of this geometric 
sequence consists of points that lie on the graph of an exponential growth 
or decay function. 


Plotting graphs of sequences with a computer 


When you plot the graphs of sequences by hand, you can usually plot only 
a small number of points. In the next activity you can find out how to use 
a computer to plot graphs of sequences showing many points. This work 
will prepare you for studying the long-term behaviour of sequences in the 
next subsection. 


Activity 17 Plotting graphs of sequences with a computer HAA 


Work through Subsection 11.1 of the Computer algebra guide. 
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Ln k 


3.2 Long-term behaviour of sequences 


A graph of a sequence can show information about the sequence only for a 
limited number of terms. We now investigate what can be said about the 
long-term behaviour of infinite sequences, that is, how each sequence 
will develop as more and more terms are considered. To start with, here’s 
some terminology that’s useful for describing the long-term behaviour of 
sequences. 


Terminology for long-term behaviour 


First, we use the words increasing and decreasing for sequences in much 
the same way as was introduced for functions in Subsection 1.4 of Unit 3. 
A sequence (£n) is increasing if £n—1 < £n for each pair of sucessive terms 
In—1 and zn, and decreasing if 7,_; > £n for each pair of successive 
terms 2,_1 and zn. The graphs in Figure 9 illustrate how this terminology 
applies. The sequences in (e) and (f) are neither increasing nor decreasing. 


Next, suppose that all the terms of a sequence (xn) lie within some 
interval [—A, A], where A is a fixed positive number, as illustrated in 
Figure 9(e), for example. Then we say that the sequence (£n) is bounded. 
If there is no fixed value of A, however large, for which all the terms of the 
sequence (£n) lie within the interval [—A, A], then we say that the 
sequence (£n) is unbounded, and also that the terms of the sequence 
become arbitrarily large. Again, the graphs in Figure 9 illustrate how 
this terminology applies. 
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Figure 9 Sequences with various long-term behaviours 
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If the terms of a sequence (£n) approach 0 more and more closely, in such 
a way that they eventually lie within any interval [—h, h], no matter how 


3 Graphs and long-term behaviour 


small the positive number h is taken to be, then we say that the terms of 
the sequence (zn) become arbitrarily small. More formally, we say that 
£n tends to 0 as n tends to infinity, and we write 


Ln > 0 as n > oo. (2) 


Three sequences with this property are illustrated in Figure 10. 


Ln A Ln A Tn A 
e e 
e 
har hF e ` hi e š 
E R E 4 r’ 
h+ © h+ h+ e 
e ° 
e 
(a) (b) (c) 
Figure 10 Sequences that tend to 0 
More generally, suppose that the terms of a sequence (£n) approach a 
particular number L more and more closely, so they eventually lie within 
any interval [L — h, L + h], no matter how small the positive number h is 
taken to be. Then we say that £n tends to L as n tends to infinity, and 
we write 
In + L as n > oo. (3) 
Three sequences with this property are illustrated in Figure 11. 
Tn A Tna © Tna © 
e 
° e 
e ° 
L4 sosooe L+ cece. L+ ete 
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Figure 11 Sequences that tend to L 


We also say in such a case that the limit of the sequence (xn) is L, and 
that the sequence (£n) converges or is convergent to the limit L. An 
alternative way to write statement (3) is 

lim zy, = L, 

N+ 0o 
which is read as ‘the limit as n tends to infinity of x-sub-n is L’. In 
particular, statement (2) above can be written as 


lim zy, = 0. 
noo 
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You saw a similar notation for other types of limits in Subsection 1.4 of 
Unit 6, and in Subsection 1.2 of Unit 8. 


We also use arrows to denote ‘tends to’ in some cases where the sequence 
is unbounded. If a sequence (£n) has the property that, whatever positive 
number A you take, no matter how large, the terms of (£n) eventually lie 
in the interval [A, oo), then we say that £n tends to infinity as n tends 
to infinity, and we write 


Ln > CO as n > CO. 


Similarly, if a sequence (xn) has the property that, whatever positive 
number A you take, the terms of (£n) eventually lie in the interval 

(—oo, —A], then we say that x, tends to minus infinity as n tends to 
infinity, and we write 


Ln > — aS n —> OO. 


These definitions are illustrated in Figure 12. 
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Figure 12 Sequences that tend to infinity or minus infinity 


3 Graphs and long-term behaviour 


Activity 18 Using the notation for long-term behaviour 


Assuming that the patterns of behaviour suggested by the graphs in 
Figure 9 continue unchanged as n increases, complete the statement 


In >... aS n — OO, 


for each of the sequences (£n) shown in Figure 9(b), (c) and (e). 


Long-term behaviour of arithmetic sequences 


You’ve seen that the closed form of an arithmetic sequence (£n) with first 
term a and common difference d is 


In =at+(n—1)d (n=1,2,3,...). 
This closed form can be rearranged as 

Zn = (a—d)+nd (n=—1,2,3,..+), 
which is the same as 

In =b+nd (n= 1,2,3,...), 


where b = a — d. This last formula is a little simpler than the first one 
since it involves n rather than n — 1. So, when studying the long-term 
behaviour of arithmetic sequences, we’ll consider sequences with the closed 
form £n = b + nd, where b and d are constants, with d Æ 0, and where the 
range of values of n is 1,2,3,.... These are arithmetic sequences with first 
term b + d and common difference d. 
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Since the points that form the graph of an arithmetic sequence lie on a 
straight line, the long-term behaviour of arithmetic sequences is 
straightforward to describe. 


Long-term behaviour of arithmetic sequences 


Suppose that (£n) is an arithmetic sequence with common 
difference d. 


e Ifd>0, then (zn) is increasing and £n > 00 as n > oo. 
e Ifd<0, then (zn) is decreasing and £n + —00 as n > oo. 


e Ifd=0, then (£n) is constant. 


The three cases are illustrated in Figure 13. 


Tn A cn Cee Tn A 
Pe a =e lenin le, i oe 
. °’ PETT 
oe e eoeevoeaev0e0e0e0e 
e o 
(J 
i SS a ae >=) A a 
123 A56 rygg e W23 A5 (678-9) 
(a) d>0 (b)d<0 fold) 


Figure 13 Graphs of arithmetic sequences 


Figure 14 The graph of the 
sequence in Example 11 


The graph of the sequence in Example 11 is shown in Figure 14. 
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Activity 19 Finding the long-term behaviour of an arithmetic 
sequence 


Describe the long-term behaviour of the sequence (zn) given by 


im =3- $n (n=1,2,3,...). 


Long-term behaviour of geometric sequences 


You’ve seen that the closed form of a geometric sequence (£n) with first 
term a and common ratio r is 


En =r! (n=1,2,3,...). 
This closed form can be rearranged as 
n= (=) r™ (n=1,2,3,...), 

which is the same as 
nS” (n= l2 3e); 


where c = a/r. This last formula is a little simpler than the original 
formula since it involves n rather than n — 1. So, when studying the 
long-term behaviour of geometric sequences, we’ll consider sequences with 
closed form cr”, where c and r are constants, with r Æ 0, and where the 
range of values of n is 1,2,3,.... These are geometric sequences with first 
term cr and common ratio r. 


The long-term behaviour of geometric sequences is much more varied than 
that of arithmetic sequences. However, you can determine the long-term 
behaviour of any particular geometric sequence if you know the long-term 
behaviour of sequences of the form (r”), for all the different possible values 
of r. So let’s start by looking at that. (The notation (r”) means the 
sequence (£n), where £n = r” for n = 1,2,3,..., as you’d expect.) 


First, there are three special cases. 
If r = 0, then (r”) is the sequence 0,0,0,0,.... 
If r = 1, then (r”) is the sequence 1,1,1,1,.... 
If r = —1, then (r”) is the sequence — 1,1,—1,1,.... 
So the sequences for r = 0 and r = 1 are constant sequences, and the 


sequence for r = —1 alternates indefinitely between —1 and 1. 


Let’s now look at the cases where r is positive, but not equal to 1. You can 
see what happens in these cases by using facts about exponential functions 
that you saw in Subsection 4.1 of Unit 3. Recall that the graph of the 
exponential function f(x) =b”, where b is positive but not equal to 1, is 


e increasing if b > 1, becoming steeper as x increases; 
e decreasing if 0 < b < 1, becoming less steep as x increases. 


These two cases are shown in Figure 15. 
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y 
xY 


(a)b>1 (b)O0<b<1 


Figure 15 Graphs of y = b” 


Remember also from Unit 3 that the x-axis is an asymptote of the graph of 
the exponential function f(x) = b”. When 0 < b < 1, this means that the 
graph gets closer and closer to the positive x-axis as x increases, as 
illustrated in Figure 15(b), so the value of b” approaches 0 more and more 
closely as x increases. 


The only difference between the sequence (r°) and the exponential 
function y = r” is that the domain of (r”) consists of all the natural 
numbers, whereas the domain of y = r” consists of all the real numbers. 
Hence the graph of the sequence (r”) is made up of isolated points, all 
lying on the graph of the function y = r”, which is a continuous curve. So 
the long-term behaviour of the sequence (r°), in the cases where r is 
positive but not equal to 1, is as follows. 


e Ifr>1, then (r”) is increasing and r” > œ as n > oo. 
e If0<r<1, then (r”) is decreasing and r” + 0 as n > co. 


These two cases are illustrated in Figure 16. 
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Figure 16 Graphs of sequences of the form (r”) 

For example, 

e the sequence (2”) is increasing and 2” — oo as n + co 

e the sequence (0.5”) is decreasing and 0.5” — 0 as n > œo, 


as illustrated in Figure 17. 
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Figure 17 The graphs of the sequences (a) (2”) (b) (0.5”) 


Now let’s consider the long-term behaviour of sequences of the form (r”) 
where r is negative but not equal to —1. 


For example, consider the sequence ((—2)"). The nth term of this 
sequence, (—2)”, can also be written as (—1)” x 2”, so the terms of the 
sequence have the same magnitude as the terms of the sequence (2"), but 
alternate in sign, as illustrated in Figure 18(a). Hence the sequence 
((—2)”) is neither increasing nor decreasing, but is unbounded. 


Similarly, the terms of the sequence ((—0.5)") have the same magnitude as 
the terms of the sequence (0.5), but alternate in sign, as illustrated in 
Figure 18(b). So the sequence ((—0.5)") is neither increasing nor 
decreasing, but (—0.5)" > 0 as n > co. 
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Figure 18 The graphs of the sequences (a) ((—2)”) (b) ((—0.5)”) 

In general, the long-term behaviour of the sequence (r”), where r is 

negative but not equal to —1, is as follows. 

e Ifr<-—1l, then r” alternates between positive and negative values, 
and (r”) is unbounded. 


e If—l1<r<0, then r” alternates between positive and negative 
values, and r” > 0 as n > œ. 
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Here’s a summary of the facts that you’ve seen about the behaviour of the 
sequence (r°), for all possible values of r. All the cases except r = 0 are 
illustrated in Figure 19. 


Long-term behaviour of the sequence (r”) 


Value of r Behaviour of (r”) 
Ff => Il Increasing, r” — oo as n —> co 
p= Constant il, mR 
O<r<il Decreasing, r” — 0 as n > oo 
T=0 Constant: 0,0,0,... 
—l<7< Alternates in sign, r” > 0 as n > co 
r=-—1 Alternates between —1 and 1 
r<-—l1 Alternates in sign, unbounded 
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Figure 19 Sequences of the form (r”) for all non-zero values of r 
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Now remember that any geometric sequence has the form (cr"). You can 
work out the long-term behaviour of any sequence of this form by thinking 
about the long-term behaviour of the sequence (r”) and using the fact that 
the terms of (cr”) are obtained by multiplying the terms of (r”) by c. 


For example, since the sequence (2”) is increasing and tends to infinity, as 
illustrated in Figure 20(a), it follows that the sequence ( x 2”) is also 
increasing and tends to infinity, as illustrated in Figure 20(b). It also 
follows that the sequence (— x 2) is decreasing and tends to minus 
infinity, as illustrated in Figure 20(c). 
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Figure 20 The sequences (a) (2”) (b) (2 x 2”) (c) (—3 x 2”) 


When you’re working out conclusions like these, it’s helpful to use the 
ideas about scalings of graphs that you met in Unit 3. For example, it 
follows from what you saw there that the graph of the sequence (5 x 2”) is 
obtained from the graph of the sequence (2”) by scaling it vertically by a 
factor of 2. This squashes the graph vertically. Similarly, the graph of the 
sequence (—$ x 2”) is obtained from the graph of the sequence (2”) by 
scaling it vertically by a factor of —3. This squashes the graph vertically, 
and reflects it in the horizontal axis. 


The following box summarises some useful facts about multiplying the 
terms of a general sequence by a constant. However, when you need to use 
these facts you may find it easier just to think about the graph of the 
sequence, as in the paragraph above, rather than apply the facts directly. 
Try to check the facts in the box by thinking about the graphs of 
sequences in this way. 
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Multiplying each term of a sequence by a constant 
Suppose that (xn) is an infinite sequence and c is a constant. 


is constant 
alternates in sign 
Ifc #0 and (z,) ¢ is bounded , then so is/does (ca,,). 
is unbounded 
tends to 0 


is increasing 
is decreasing 
tends to oo 

tends to —oo 


lie > 0 and ep) , then so is/does (cxn). 


is increasing is decreasing 

is decreasing is increasing 
Bes O aral a) tends to oo BOO EE) tends to —oo 

tends to —co tends to oo 


You might find the tutorial clip for the example below particularly helpful. 


Cc) Example 12 Finding the long-term behaviour of geometric 
—= sequences 


Describe the long-term behaviour of each of the sequences given by 
the following closed forms. 


(a) 2, =—20 <0. G—=1,2.3)..) 
(Dia oe m=] 
le) za =AL. = 1,253...) 
Solution 


@. Use the facts in the box on page 44 and in the box above, and 

think about the graphs of the sequences involved. © 

(a) Since 0 < 0.7 < 1, the sequence (0.7”) is decreasing and 0.7” — 0 
as n — oo. 
To obtain (£n) we multiply each term by the negative constant 
—20. Hence (zn) is increasing and £n —> 0 as n > oo. 

(b) Since 1.5 > 1, the sequence (1.5") is increasing and 1.5” — oo as 
n— oOo. 


To obtain (yn) we multiply each term by the positive constant E, 
Hence (yn) is increasing and yn — œ as n > oo. 
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(c) Since —1.1 < —1, the sequence ((—1.1)”) alternates in sign and is 
unbounded. 


To obtain (zn) we multiply each term by the non-zero constant 2. 
Hence (zn) also alternates in sign and is unbounded. 


The graphs of the sequences in Example 12 are shown in Figure 21. You 
can see that the long-term behaviour of the sequences appears to be as 
determined in the solution to Example 12. 


Ena Yn 

ileal = e 

123456988" 10 R 
254 e 

° 5 ° 
o. 

Sis : | ẹe s9? ek 

. 123456789" 


Figure 21 The graphs of the sequences in Example 12 


Activity 20 Finding the long-term behaviour of geometric sequences 


Describe the long-term behaviour of each of the sequences given by the 
following closed forms. 

(a) £n =—4 x 1.2” (n=1,2,3,...) 

(b) Yn = 5(—0.9)” (n= 1,2,3,...) 

(c) zn = -5(725) (n=1,2,3,...) 

You can check your answers to this question by using the computer algebra 
system to plot the graphs of the sequences. 


Long-term behaviour of further sequences 


You can find the long-term behaviour of slightly more complicated 
sequences by using the following facts. 


Suppose that a, c and L are constants. 

e If £n —> Lasn — œ, then £n +a — L+a asn —> oœ. 
e If £n > œ as n —> œ, then £n +a > œ as n > oo. 

e If £n ~ —o as n > œ, then tn +a > —œ as n > œ. 


e Ifa, —> Lasn —> œ, then cr, > cL as n > œ. 
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To understand why these facts hold, you can again use the ideas about 
scalings and translations of graphs that you met in Unit 3. For example, 
since the sequence (0.5") has limit 0, as illustrated in Figure 22(a), it 
follows that the sequence (0.5” + 0.1) has limit 0.1, as illustrated in 
Figure 22(b). 
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Figure 22 The sequences (a) (0.5") (b) (0.5” + 0.1) 


The following example shows how you can use the facts above, together 
with the facts that you met earlier in this subsection. 


| CS Example 13 Finding the long-term behaviour of more sequences 


= Describe the long-term behaviour of each of the sequences with the 
following closed forms. 


(a) ac, = — 30% 0.9" 80 (v= 1,253...) 

(DG — Oe gr ee) 

Solution 

@. Deal first with the term that involves r”. © 


(a) Since 0 < 0.9 < 1, the sequence (0.9") is decreasing and has 
limit 0. Hence the sequence (—30 x 0.9”) is increasing and also 
has limit 0. Adding 80 to each term gives another increasing 
sequence, with limit 80. Hence the sequence (£n) is increasing 
and £n — 80 as n > oo. 


Since 2 > 1, the sequence (2”) is increasing, and 2” — oo as 

n — oo. Since the constant 5 is positive, the sequence (5 x 2”) is 
also increasing and 5 x 2” — oo as n —> oo. Subtracting 7 from 
each term gives another increasing sequence that tends to infinity. 
Hence the sequence (yn) is increasing and yn —> oo as n — oo. 


= 
< 


The graphs of the sequences in Example 13 are shown in Figure 23. 
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Figure 23 The graphs of the sequences in Example 13 


Activity 21 Finding the long-term behaviour of more sequences 


Describe the long-term behaviour of each of the sequences with the 
following closed forms. (These sequences are closely related to the 
sequences in Activity 20.) 


(a) @g=17- 2X12" (n=1,2,3,...) 
(b) bn =5(—0.9)" +45 (n=1,2,3,...) 


You can check your answers to this question by using the computer algebra 
system to plot the graphs of the sequences. 


4 Series 


In this section you'll investigate the sums of terms of sequences, and meet 
a useful notation for such sums. 


4.1 Summing finite series 


It’s sometimes useful or interesting to add up consecutive terms of a 
sequence. For example, consider the following sums: 


Oi 0? oj. OF 4. OF 5. 0P 
(3 x 4) + (3 x 5) + (3 x 6) +--+ (3 x 20). 


The first expression here is the sum of the first five terms of the geometric 
sequence (2”), and the second expression is the sum of the terms, from the 
fourth to the twentieth, of the arithmetic sequence (3n). Expressions like 
these are called series. That is, a series is an expression obtained by 
adding consecutive terms of a sequence. (The singular and plural forms of 
the word ‘series’ are the same.) 


The series above are finite series, since they have only a finite number of 
terms. By contrast, an infinite series has an infinite number of terms. 
We’ll consider finite series in this subsection and infinite series in the next. 


4 Series 
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Series are important in calculus, as you'll see in the next unit, and are also 
used frequently in statistics. They can be of interest too in their own right. 
For example, look at the series below, whose terms come from the sequence 
of odd integers: 


1=1 
1+3=4 
1+3+5=9 


1+3+5+7=16. 


An obvious question is whether this pattern of adding up consecutive odd 
integers to obtain square numbers continues. You’ll be able to answer that 
question later in the subsection. 


The number that you obtain when you add up all the terms of a series is 
called the sum of the series, and the process of finding this sum is called 
summing the series, or evaluating the series. For example, the sum of 
the series 1+3+5+7 is 16. You may think that this terminology is a 
little strange, because a series is already a sum, but it’s standard 
terminology, and is convenient in practice. 


In the rest of this subsection we’ll look at summing some particular types 
of finite series. 


Finite arithmetic series 


Let’s start by looking at sums of finite arithmetic series. As you’d expect, 
an arithmetic series is one whose terms come from an arithmetic 
sequence. Here’s an example: 


5+84+114+ 14+17+ 204 23 + 26 + 29 + 32 +35 + 38 + 41 + 44. 


The terms of this series are the terms of an arithmetic sequence with first 
term 5 and common difference 3. One way to find the sum of this series is 
simply to add up all 14 terms. However, there’s another and more 
illuminating way of finding the sum. First reverse the order of the terms of 
the series, and then write the result under the original series, with the 
terms aligned: 


5 +84+114+144+ 17+ 204 23 + 26 + 29 + 324 35+ 38+ 41 + 44 
44+ 41+ 384 35 + 32 + 29 + 264 23 +20 +17 +14+11 +8 +5. 


Then add the two copies of the series together, starting by adding each 
term to the one below. This gives 
49 + 49 + 49 + 49 + 49 + 49 + 49 + 49 + 49 + 49 + 49 + 49 + 49 + 49 
= 14 x 49 = 686. 


This number is twice the sum of the original series (since we added two 
copies of the series together). So the sum of the original series is 


5 x 686 = 343. 


The reason for all the 49s above is that each term in the original series is 3 
more than the one before, whereas each term in the reverse series is 3 less 
than the one before. When you add each term to the one below, these 


increases and decreases cancel out, and you obtain a new series all of whose 
terms are equal to its first term. The first term of this new series is the 
sum of the first and last terms of the original series, that is, 5 + 44 = 49. 


You can see that you could use the same approach to find the sum of any 
finite arithmetic series. So, in general, the sum of any finite arithmetic 
series is 


5 x (number of terms) x (first term + last term). (4) 
If you don’t know the number of terms then you can work it out by using 
the equation 


last term — first term 
number of terms = - +1, 
common difference 


which you saw on page 19. 


Activity 22 Finding the sum of an arithmetic series 


Find the sum of the arithmetic series 


6+ 13+ 20+---+ 90. 


As you saw in Subsection 2.2, the nth term of the arithmetic sequence (£n) 
with first term a and common difference d is given by 


te = ah (a— jd. 
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It follows from this formula and expression (4) that the sum of an 
arithmetic series with n terms is 
5 x (number of terms) x (first term + last term) 
= 5n(r1 + Xn) 
1 
2 
1 


= jna +a + (n — 1)d) 
= zn (2a + (n — 1)d). 


This useful formula is summarised in the box below. 


Sum of a finite arithmetic series 


The arithmetic series with first term a, common difference d and 
n terms has sum 


1 x (number of terms) x (first term + last term); 
that is, 


in (2a + (n — 1)d). o 


Activity 23 Finding the sums of more arithmetic series 


Use formula (5) to find the sums of the following arithmetic series. 
(a) 14+24+3+---+100 

(b) 12+15+18+---+60 

(c) 1+345+---+19 


As you'll see shortly, the result of Activity 23(a) can be generalised to a 
useful formula for the sum of the first n natural numbers. Also, you can 
generalise the result of Activity 23(c) to prove that adding up consecutive 
odd numbers, starting at 1, always gives a square number — see the 
solution to the activity. 


Finite geometric series 


Now let’s look at sums of geometric series: a geometric series is a series 
whose terms come from a geometric sequence. Again there’s a particular 
approach that can be used to evaluate the sum of such a series. We’ll 
apply it to find a general formula for the sum of a finite geometric series, 
after first considering a particular example. 


Suppose that you want to evaluate the sum of the finite geometric series 
Legs FDP to ee ae oa 


One way to find this sum is simply to evaluate all of the 12 terms and 
then add them together. But there is a quicker and more illuminating way 
of finding this sum. Suppose that the sum is s. First write down the 


expressions for s and for 2s on successive lines (here 2s was chosen 
because 2 is the common ratio of the series): 


s= 1-0 4 ore eee to pot et toe 
25 = ao 0 a a a Oy a oe a OI oN a 


The terms in the sums s and 2s have been aligned to emphasise that 
they’re the same, except for the appearance of 1 in the expression for s, 
and 2!” in the expression for 2s. Now subtract the first of these equations 
from the second. All the common terms cancel on the right-hand side, so 


s=2' — 1 = 4096 — 1 = 4095. 


You can use the same approach to find a general formula for the sum of a 
finite geometric series. 


Consider the finite geometric series with first term a, common ratio r 
and n terms, and suppose that its sum is s; that is, 


s=a+ar+ar? +ar? +- + art. (6) 
Then multiply the series by the common ratio r, and write it down below 


the original series, with like terms aligned. This gives 


s=atar+ar*+are+---tar™}, 


rs= ar +ar* +ar eed gph) + ar”. 


If you now subtract the bottom series from the top series, then most of the 
terms cancel each other out, and you obtain 


$s—rs=a-—ar”. 
Taking out the common factor on each side gives 
(l-r)s=a(1—-r”). 
So, if r #1, then 
omn 
s=, 
If r = 1, then it follows directly from equation (6) that 


S=at+a+t+:::+4a4=n4. 
am 


n terms 


Sum of a finite geometric series 


The geometric series with first term a, common ratio r 4 1 and 
n terms has sum 
a(l — r”) 
aa o 
=r 


As you saw earlier, the common ratio of a geometric sequence can be 
positive or negative, and when it’s negative the terms of the sequence 
alternate in sign. Expression (7) still applies when the common ratio is 
negative. 
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Figure 24 A geometric 
interpretation of 
14+24+34+4+5=5x5x6 


Carl Friedrich Gauss 
(1777-1855) 
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Activity 24 Finding the sums of geometric series 


(a) Find the sum of the first 10 powers of 3, that is, 
3+3? +3 4+---437. 
(b) Find the value of the sum 
2 3 8 
La FU (a) Rra 


giving your answer correct to six decimal places. 


Other finite series 


We now briefly consider some formulas for the sums of other standard 
finite series. 


First, consider the sum of the first n natural numbers: 
1+243+---4n. 


This is a finite arithmetic series with first term 1, last term n and n terms, 
so its sum is 


5 x (number of terms) x (first term + last term) = $n(1 + n). 


Sum of the first n natural numbers 


14+24+3+4---+n= 5n(n+1) 


This result has a geometric interpretation, which is illustrated (for the case 
n = 5) in Figure 24. The number of shaded dots, which is 1+2+3+4+5, 
is equal to half of the total number of dots, which is - x5x 6. 


It is said that the great German mathematician Carl Friedrich Gauss, 
at the age of ten, was asked along with the rest of his school class to 
find the sum 1 +2 +3 +---+100. The teacher intended this to be a 
lengthy task. However, Gauss came up with the correct answer 
almost immediately, by applying the approach used earlier to derive 
expression (4) on page 51. 


There are also formulas for the sums of the first n square numbers and the 
first n cube numbers, as stated in the following box. The proofs of these 
formulas are beyond the scope of this module, but you'll see the proofs if 
you go on to study MST125 Essential mathematics 2. 


Sum of the first n square or cube numbers 
1? +2? +3? +---+n? = gn(n + 1)(2n + 1) 
1 +2 +3%4---+n3 = in?(n+1)? 


Activity 25 Using the formulas for the sums of consecutive natural 
numbers, squares and cubes 

(a) Use the formulas above to find the sums of the following finite series. 
(i) 14+2+3+---+30 

(ii) 1? +274 3?+---+10? 

(ii) 1? +2? +3? 4.--- 430? 

(iv) 13 +23 +33 +... +303 

( 


(b) (i) Use your answers to parts (a)(ii) and (a)(iii) to find the sum of 
the series 
11? + 127 + 13? + --- +307. 


(ii) Use a similar method to find the sum of the series 


503 as 2? 4 99 eee, 


4.2 Summing infinite series 


In this subsection we’ll look at infinite series, which are expressions such 
as 
aj +ag+agt+::-, 


where (an) is a sequence. So that you can see what it means to add up all 
the terms in an infinite series, let’s start by considering the infinite series 


B+) + (+ GY 


This is an infinite geometric series — its terms are those of the infinite 


geometric sequence with first term i and common ratio Z. Consider what 


happens as you add up more and more of the terms of this series: 


j=} 
+G =i+i=i 

GtG =h4h+b=3 
OPG +O itiiti 

OPG +Q)+ QP =+ tth 
TORIO ORAO E EET 
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As you add on more and more terms, the totals get closer and closer to 1. 
In fact, they get arbitrarily close to 1, but never reach 1, because each time 
you add on a new term, the difference between the total and 1 halves. You 
can think of this as meaning that, if you add on infinitely many terms, 
then the total is exactly 1. This is illustrated in Figure 25. So in this sense 
the sum of the infinite series above is 1. 


2 CG) =a) 
ic a a a vo n 
| l | | | | 
a 1 1 


Figure 25 The sum of the infinite series i + ey + ee +--+ isl 


tf | eat half today, One of Zeno’s paradoxes 
thew half of what's Left 
tomorrow, and so on, 


The result illustrated in Figure 25 is related to one of the famous 

er paradoxes attributed to the Greek mathematician Zeno of Elea 

will ít Last forever? (c. 490-430 Bc). It is known as the dichotomy paradoz, and also as 
the race course paradoz. One statement of it is based on the following: 


That which is in locomotion must arrive at the half-way stage 
before it arrives at the goal. 


(Aristotle, Physics VI:9, trans. R.P. Hardie and R.K. Gaye (2009), 
New York, Digireads.com.) 


Before covering a fixed distance, half the distance must be covered, 
but before that, a quarter of the distance must be covered. Before 
that, an eighth must be covered, and so on. As this involves an 
infinite number of tasks, it is impossible to achieve. 


Many infinite series don’t have sums. For example, consider the infinite 
geometric series with first term 1 and common ratio 2: 


142427423 4.... 


Consider what happens as you add up more and more of the terms of this 


series: 
1=1 
142=3 


14+2+2?=1+4+2+4=7 
142427429 =142444+8=15 
1+2 +2 +2 +24=1+2+4+8+16= 31 
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With this series, as you add on more and more terms, the total just keeps 
getting larger and larger, without getting closer and closer to any 
particular number. So this infinite series doesn’t have a sum. 


The ideas illustrated above can be applied to any infinite series, to 
determine whether it has a sum, and to find the value of its sum if it has 
one. We make the following definitions. 


Consider any infinite series 
aj +ag+agz+r::. 
Let 
$1 = 1, 
S2 = Q1 + Q2, 


S3 =a, + a2 + Q3, 


S4 =a, + a2 + a3 + Q4, 


and so on. The numbers s1, s2, 53,... are called the partial sums of the 
series, and the infinite sequence (sn) that they form is called the sequence 
of partial sums of the series. 


If the sequence of partial sums of an infinite series converges to a limit, 
say s, then we call s the sum of the infinite series. On the other hand, if 
the sequence of partial sums doesn’t converge, then the infinite series 
doesn’t have a sum. 


For example, you saw at the beginning of this subsection that the infinite 
series 


HOE H 
has the sequence of partial sums 


1 3 7 15 31 
2? 4) 8? I6 32 t: 


Since this sequence converges to 1, the infinite series has sum 1, as you saw. 
Similarly, you saw that the infinite series 
1+2+2 +2 +- 
has the sequence of partial sums 
1, 3, 7, 15, 31l, .... 
Since this sequence doesn’t converge, the infinite series has no sum. 


It’s often quite difficult to determine whether the sequence of partial sums 
of a particular infinite series converges, and if it does converge, what the 
limit is. However, it’s fairly straightforward to deal with infinite arithmetic 
and geometric series. We’ll consider each of these two types of infinite 
series in turn, starting with geometric series. 
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Infinite geometric series 
Every infinite geometric series has the form 
atar+ar?+ar?+--- , 


where a is the first term and r is the common ratio. A useful way to 
determine whether such a series has a sum is to use the fact that each of 
its partial sums is the sum of a finite geometric series, and to apply the 
formula that you met in the previous subsection for such a sum. 


To illustrate this approach, consider once more the infinite geometric series 


with first term - and common ratio i: 


TOROA OS 


Its partial sums are 


s=4 

a1 1)2 
s=3+(3 
so $4 (4) + (4)" 


In general, the nth term of its sequence of partial sums is 
_1 1)2 lyr 
megt tea 


This is the sum of a finite geometric series with first term a = 4, common 


9) 
ratio r = 5 and n terms, so by formula (7) on page 53, we have 


ph, Pai) 27-2)" n 
e Eo 0) 20-99 yy 


Since (3)” + 0 as n > œ, it follows that 


8, =1—(4)" 31-0=1 as n — OOo. 


So the sequence of partial sums of this series converges to 1, and hence the 
series has sum 1. This confirms the result found earlier. 


You can obtain a general result about the sums of infinite geometric series 
by applying the same approach. Consider the infinite geometric series with 
first term a Æ 0 and common ratio r: 

atartar?+are+---. 
Its partial sums are 

Ss, =a 

S23 =a+ar 

83 =atar+ ar? 


s4=atar+ar? +ar? 


The nth term of its sequence of partial sums is 
Sn =atartar?+---+ar™}, 
This is the sum of a finite geometric series with first term a, common 
ratio r and n terms. So by formula (7) on page 53, if r Æ 1, then 
a(l — r”) 
Sn = a ee 
If -1<r<1, then r” > 0 as n > œo, as you saw on page 44, and hence 
a(l—r” 1-0 
= (=r?) e a( ) __4 
l-r l-r l-r 


So if —1 < r < 1, then the series has the sum a/(1 — r). 


as n — oo. 


If r< -—1 orr > 1, then r” is unbounded, as you also saw on page 44, and 
it follows that sn = a(1 — r”)/(1 — r) is also unbounded. So if r < —1 or 
r > 1, then the series doesn’t have a sum. 


The only other possible values of r are r = —1 and r = 1. When r = —1, 
the series is 


a-at+a-a+a-at:::. 


The sequence of partial sums of this series is a,0,a,0,a,..., which doesn’t 
converge (since a Æ 0), so the series doesn’t have a sum. When r = 1, the 
series is 


atat+ta+a+a+t+::-. 


The sequence of partial sums of this series is a, 2a, 3a,4a,..., which 
doesn’t converge (since a # 0), so again the series doesn’t have a sum. 


In summary, the following facts hold. 


Sum of an infinite geometric series 


The infinite geometric series with first term a 4 0 and common 
ratio r has 


sum —, hh —H=l<p< ik 
l= 


nosum, ifr<—lorr>l1. 


The only infinite geometric series with first term a = 0 is the series 
0+0+0+---, which has sum 0. 
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Here are some examples for you to try. 


Activity 26 Summing infinite geometric series 


For each of the following infinite geometric series, determine whether or 
not it has a sum, and find the value of the sum if it exists. 


O F424 D+ Qe 
(0) 1-44 E + 

(J ho as 
( 


a) $-4(2) +4@)-4@) + 


In fact, you’ve been dealing with the sums of infinite geometric series since 
Unit 1 of this module, because any recurring decimal can be thought of as 
the sum of a terminating decimal (possibly 0) plus an infinite geometric 
series. For example, 


0.333333... =3(a) +3(#)°+3(5)° +: , 
5.178 178 178178... = 5 +178 (m0) + 178 (0) +178 (ag) + 


It follows that one way to find a fraction equivalent to a given recurring 
decimal is to use the formula a/(1 — r) for the sum of a geometric series. 


However, there’s a neater method, which is equivalent to using this 
formula. It’s illustrated in the next example. 


To find a fraction equivalent to a recurring decimal that has one or more 
non-zero digits before the recurring part, you can apply the method of 
Example 16 to find a fraction equivalent to the recurring part, and then 
add this to the number formed by the other digits. For example, by the 
solution to Example 16, 


1.123 123123... =1+ Te 
f a 333 333 


Activity 27 Finding fractions equivalent to recurring decimals 


Find a fraction equivalent to each of the following numbers. 
(a) 0.454545... (b) 3.729 729 729... 


We’ll now look at what happens when you try to sum an infinite 
arithmetic series. 


Infinite arithmetic series 
Every infinite arithmetic series has the form 
a+(a+d)+(a+2d)+-+, 


where a is the first term and d is the common difference. 


4 Series 


61 


Unit 10 Sequences and series 


62 


The situation for infinite arithmetic series is much simpler than for infinite 
geometric series. In summary, the only infinite arithmetic series that has a 
sum is the infinite series with first term 0 and common difference 0, that is, 
the series 0+0+0-+---, which has sum 0. You can see this by using an 
approach similar to the one used earlier for infinite geometric series. You 
use the fact that each partial sum of an infinite arithmetic series is the sum 
of a finite arithmetic series, and apply the formula for the sum of a finite 
arithmetic series that you met in Subsection 4.1, as follows. 


Consider the infinite arithmetic series with first term a and common 
difference d: 


at+(a+d)+(a+2d)+---. 
The nth term of its sequence of partial sums is 
Sn =a+(a+d)+(a+4 2d)+---+(a+(n—1)d). 


This is the sum of a finite arithmetic series with first term a, common 
difference d and n terms. So, by formula (5) on page 52, we have 


Sn = 4n(2a + (n — 1)d). 


If d #0, then the expression on the right-hand side of this equation is a 
quadratic expression in n, so Sn tends to infinity or minus infinity. If d = 0, 
then sn = na, so again sn tends to infinity or minus infinity, as long as 

a #0. Therefore, provided that a and d are not both zero, the series 
doesn’t have a sum. 


4.3 Sigma notation 


There’s a useful notation for writing series concisely. It can be used for 
both finite and infinite series, but we’ll begin by looking at how it’s used 
for finite series. 


For any sequence (£n), the sum 
Lp + Lp+1 +++ + Tq 


(that is, the sum of the terms from the pth term to the qth term) is 
denoted by 


oo (8) 


(This is read as ‘the sum from n equals p to q of x-sub-n.) This notation is 
called sigma notation or summation notation — the symbol ` is the 
upper-case Greek letter sigma. The numbers p and q, which tell you the 
terms to start and finish at, respectively, are called the lower and upper 
limits of the summation, respectively, and the variable n is called an 
index variable. For example, with this notation we can write 


5 
2427428424422 = 5° 2”, 
n=1 
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and 
20 
(3 x 4) + (3 x 5) + (3 x 6) +--+ + (3 x 20) = X` 3n. 
n=4 


The index variable n in sigma notation is a dummy variable — you can use 
any other variable name in its place. For example, 


5 5 5 
2 3 4 5 n jo k 
242742842442 =S r=) a= Soo. 
n=1 j=l k=1 


In this module we'll usually use n or k for the index variable in sigma 
notation. Sometimes for a finite series it’s natural to use n to denote the 
upper limit, and in that situation we’ll use k for the index variable. 


When sigma notation is in a line of text, it’s sometimes written with the 
limits to the right of, instead of below and above, the symbol $`. For 
example, expression (8) can be written as oS Tn- 


Example 17 Converting from and to sigma notation A 


(a) Write each of the following sums without sigma notation, giving — 
the first three terms and the last. 


G 19 ily 
OSa OFr wS a 
n=1 n=4 n=1 


(b) Write each of the following sums in sigma notation. 

(i) 3°44 459+.--.498 

@i) 142434-..4n 

(iii) The finite geometric series a + ar + ar? +---+ar"}. 
Solution 


(a) ®. In each case, put n equal in turn to each of the integers from 
the lower to the upper limit of the sum, and add the resulting 
terms. ® 


The sums are as follows. 
y 
(i) Son=14+24+3+---+7 
m= 
19 
(i) X n? =4 +57 +6? +- +19? 
n=4 
7 
(iii) S0(n +3) =44+54+6+---+20 


m= 
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(b) ®. In each case, find an expression for a typical term of the sum, 
then identify the lower and upper limits. © 


(i) The sum is 3° + 43 + 53 H- +99. 


@. The terms are of the form n?, going from n = 3 
ton=—9. # 
9 
It can be written as yy n>. 
n3 


(ii) The sum is 1+2+3+---+n. 


@. The sum already contains the variable name n, so we 
have to use a different letter for the index variable, say k. 
The terms are of the form k, going from k = 1 tok =n. & 


n 
It can be written as Se Ie. 
k=1 


(iii) The sum is a+ ar + ar? i Far l 


@. Again we have to use a letter other than n for the index 
variable, say k. The terms are of the form ar*—!, going from 
k=1tok=n. (Recall that ar® = a x 1 =a and that 
ar! = ar.) ® 

n 


It can be written as Se arn. 


k=1 
n-1 
This sum can also be written as ` ar". 
k=0 


Activity 28 Converting from and to sigma notation 


(a) Write each of the following sums without sigma notation, giving the 
first three terms and the last. 


20 19 6 
OSa Ant i) SO(Qn- 1) 
n=5 n=4 n=1 


(b) Write each of the following sums in sigma notation. 
OSa heed 150 
(ii) 57+674+77+--- +413? 
(iii) 24+27+23 +.. +2! 


You might have noticed that the series in Activity 28(a)(i) and (ii) are the 
same, even though they’re written differently in sigma notation. The same 
applies to the two alternative expressions given in the solution to 
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Example 17(b)(iii). In general, any series can be written in many different 
ways using sigma notation, even if you use the same index variable. 


The formulas that you met earlier for the sums of finite arithmetic and 
geometric series can be stated concisely using sigma notation, as in the box 
below. We use k for the index variable since the upper limit is n in each 
series. In fact we'll use k as the index variable in the rest of this 
subsection, since the upper limit is sometimes n. 


Sums of finite arithmetic and geometric series (in sigma 
notation) 


Similarly, the formulas for the sum of the first n natural numbers, the sum 
of the first n square numbers and the sum of the first n cube numbers can 
be stated concisely using sigma notation, as in the box below. The box 
also includes, at the beginning, the simple formula for adding up n copies 
of the number 1, which is sometimes useful when you’re working with 
series in sigma notation, as you’ll see shortly. 


Sums of standard finite series (in sigma notation) 


XOK = gn(n + 1)(2n+ 1) 


SO = gn? (nt 1)? 


Activity 29 Using formulas in sigma notation 


Use the formulas in the box above to find the sums of the following series. 


24 24 
(a) Dok (b) SUR 
k=1 k=1 
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Sigma notation allows you to work more easily with complicated finite 
series, and hence find their sums efficiently. To do this, you need to become 
familiar with a few rules for manipulating finite series in sigma notation. 
These are stated in the following box, and explained after the box. 


Rules for manipulating finite series in sigma notation 


q q 
> Gip = DD Tk 
kap KZP. 

q 


De Bea.) 


k=p 


5 Lk = 5 Tk — - Tk 
b= k=l k=l 


(where c is a constant) 


q q 


= 


R= 


(where 1 < p < q) 


The second rule in the box also holds if you replace the plus signs by 
minus signs: 


q 
S "(rk — yk) =D Lm 
k=p 


To see why these rules hold, you can translate them from sigma notation 
into the usual, longer notation for sums. When you do this, the first rule 
becomes 


Ctp Olgas H tety = Clip tipi es ey 


This is just the usual rule for multiplying out brackets. 
The second rule becomes 
(£p + Yp) + (Ep41 + Ypti) + °° 
= (tp + Upp +++ + aq) + (Yp + Yq) 


This rule holds simply because you can add numbers in any order. A 
similar argument shows that the version with minus signs holds. 
(Alternatively, you can deduce the version with minus signs by combining 
the first two rules in the box, taking c = —1.) 


(za + Yq) 
+ Yp+1 Fee 


Finally, the third rule becomes 


Lp + p1 +++ + Tq 


= (x1 + z2 +- + xq) — (£1 + £2 +++ + 2p-1) 


This rule says that if you split a series into two parts, then the sum of the 
second part is equal to the sum of the whole series minus the sum of the 
first part. You were asked to use this fact in Activity 25(b) on page 55. 


The next example illustrates how you can use the rules for manipulating 
finite series, together with some of the standard formulas for the sums of 
finite series, to find the sums of some other finite series. 


= 
Example 18 Using series manipulations to find the sums of finite A 
series a 
Find the sums of the following finite series. 


25 100 
(a) X (k? +2k) (b) X` (9k -4) 
k=l k=50 
Solution 


(a) @. Use the rules for manipulating series to express the series in 
terms of simpler series. ©& 


25 
X (k? + 2k) = D 
kil 
D” 
k=l k=l 


@. Use the formulas for the sums of standard series. ® 
= $(25)(25 + 1)(2 x 25 + 1) + 2 x $(25)(25 + 1) 
= % x 25 x 26 x 51 + 25 x 26 
= 5525 + 650 
=6175 


(b) &. The lower limit isn’t 1, so start by applying the third rule for 
manipulating series. ® 


100 100 49 
S (9k — 4) = X (9k — 4) -— X (9k — 4) 
k=50 k=1 k=l 


®. Now use the other rules to express each of the series on the 
right-hand side in terms of simpler series. ® 


Now 
100 100 100 
S (9k - 4) oe ae 
kil 


100 100 


=95 k Ay l 
k=l R= 


=9 x $x 100 x (100 + 1) — 4 x 100 
= 45 450 — 400 
= 45, 050. 
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Activity 30 Using series manipulations to find the sums of finite 
series 


Find the sums of the following finite series. 


30 40 125 
(a) X (2k? - k) (b) XO Gk? -1) (c) X (6K +7) 
k=1 k=1 k=65 


Sigma notation for infinite series 


To write an infinite series in sigma notation, you write the symbol oo in 
place of the upper limit. For example, 


HHOHO +=)" 


Activity 31 Using sigma notation for infinite series 
Write the following infinite series in sigma notation. 

(a) §+(5) + (3) + 

OE Gi ii te 

(c) TAG + (3) 


The formula that you met earlier (on page 59) for the sum of an infinite 
geometric series can be stated concisely using sigma notation, as follows. 


Sum of an infinite geometric series (in sigma notation) 


If—1 <r <1, then 


= a 

J ar”! = i . 
=i 

n=l 


The rules for manipulating finite series in sigma notation, given in the box 
on page 66, apply also for infinite series once the upper limit q is replaced 
everywhere by oo, provided that each series involved has a sum. Thus, for 
example, we have 


(()" + (3)") = 


n=1 


oO 


—— 
whe 
ge: 

z 


3 


+ 


e 

= 

+ 
iM 


ll 
nie = 


HH 
| 


w= 


Il 
= 
+ 
NI = nie 


NIV =e 


4.4 Summing series using a computer 


In the following activity you can learn how to use the module computer 
algebra system to work with series. 


Activity 32 Summing series on a computer 


Work through Subsection 11.2 of the Computer algebra guide. 


5 The binomial theorem 


In this final section you’ll meet an important result, called the binomial 
theorem, which will help you to multiply out expressions such as 


(l1+2)*, (a+b) and (2y—3)°. 
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5.1 Expanding powers of binomials 
In Subsection 2.3 of Unit 1, you saw how to expand squared brackets, such 
as 
(x-5), (a@+1)? and (2p— 3q}. (9) 
For example, 
(a — 5)? = (x — 5)(x — 5) 
= x? — 5g — 5x + 25 
= z? — 10x + 25. 
Each of expressions (9) is of the form 
(a +b)’, 
where a and b represent terms. For example, in the first expression a = x 
and b = —5, and in the third one a = 2p and b = —34q. 


You saw in Unit 1 that the following general formula holds for multiplying 
out squared brackets: 


(a+b)? =a? + 2ab +b. (10) 
So the square of the sum of two terms is equal to the square of the first 


term, plus twice the product of the two terms, plus the square of the 
second term. 


The next example reminds you how equation (10) is applied. 


In Unit 1 you also met the formula 

(a — b)? = a? — 2ab + b°, (11) 
but in fact formula (10) above is enough alone. For example, you saw in 
Example 19 that you can expand (2x — 1)? by using formula (10) with 


a = 2x and b = —1, as an alternative to using formula (11) with a = 2x 
and b = 1. 


Activity 33 Using the formula to expand squared brackets 


Use formula (10) to expand each of the following squared brackets. 
(a) (c+5)* = (b) (1-3) (ce) (p’ — @)? 


In Activity 33 you probably found that using formula (10) to expand the 
squared brackets is only a little quicker than just writing the expression as 
two pairs of brackets multiplied together and expanding them in the usual 
way. However, sometimes you have to expand cubed brackets, of the form 
(a + b)3, or similar brackets raised to an even higher power. It’s much 
quicker to use a formula to expand expressions like these. In this subsection 
you'll see a general formula for expanding any expression of the form 


(a +b)”, 


where n is a natural number and a and b are terms. This formula is known 
as the binomial theorem. A binomial is an expression that is the sum of 
two terms, so (a + b)” is a power of a binomial. 


The word ‘binomial’ is derived from a Latin word meaning ‘having 
two names’, and is related to the word ‘polynomial’. 


You’ve seen and used a formula for (a + b)?, so let’s now find a formula for 
(a +b)’. The easiest way to do this is to take the formula for (a + b)?, and 
multiply it by a+ b: 
(a +b)? = (a+ b)(a +b)? 

(a + b)(a? + 2ab + b?) 
= a(a? + 2ab + b°) + b(a? + 2ab + b°) 
a> +2a7b+ ab? 

+ a?b+ 2ab? +B? 
= a? + 3a°b + 3ab? + b°. 


Notice that in the working above, the expression after the fourth equals 
sign has been written over two lines, with the like terms aligned in 
columns. This makes it easy to collect the like terms. The working shows 
that the formula for (a + b)? is 


(a +b)? = a? + 3a7b + 3ab? + b’. (12) 


Activity 34 Using the formula to expand cubed brackets 


Use formula (12) to expand the following cubed brackets. 
(a) (+p)? (b) (1-22)? (c) (2a + 3y)? 
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So that you can begin to see what happens for higher powers of binomials, 
let’s work out one more formula, for (a + b)*. We take the formula for 
(a +b)? and multiply it by a + b: 
(a+b)* = (a+ b)(a +b)? 
= (a+ b) (a? + 3a7b + 3ab? + b?) 
= a(a? + 3a°b + 3ab? + b?) + b(a? + 3a7b + 3ab? + b°) 
= a + 3a°b + 30°b? + ab? 
+ a°b+3a7b? + 3ab? + bt 
= a" + 4a°b + 6a7b? + dab? + 07. 
So the formula for (a + b)* is 
(a+b)* = at + 4ab + 6a?b? + 4ab? + bt. 


You could use the same method to find formulas for (a + b)°, (a +b)® and 
so on: you just keep multiplying by a + b. The formulas that are obtained 
by doing this are shown below, along with the three formulas found above. 
The ‘formulas’ for (a + b)? and (a +b)! are also shown, as it’s helpful to 
consider these as part of the general pattern. Remember that any number 
raised to the power 0 is 1. The right-hand sides of the formulas have been 
aligned at their centres, to make it easier to see a particular pattern in 
them, which will be described shortly. 


(a +b)? = 1 

(a+b)! = a+b 

(a+b)? = a? + 2ab + b? 

(a+b)? = a? + 3a7b + 3ab? + b? 

(a+b)* = af + 4a%b + 6a?b? + 4ab? + b4 

(a+b)? = að + 5a*b + 10ab? + 10a7b? + 5abt + b5 
(a+b) = af +6a°b + 15a4b? + 20a%b? + 15a7b4 + Gab? + bê 


There are two useful things to observe about the formulas above. 


First, notice that in each formula on the right-hand side, the powers in 
each term add up to the power of the brackets on the left-hand side. For 
example, look at the formula for (a + b)*, and at the second term on the 
right-hand side, which is 4a?b. In this term, a has power 3 and b has 
power 1, and 3+ 1 = 4, which is the power of the brackets in (a + b)*. You 
can check that the powers add up to 4 in all the other terms in the formula 
for (a + b)4. 


You can see why this property holds for all the formulas if you look at, and 
think about, the working that produced the formulas for (a + b)?, (a + b)’ 
and (a+ b)*. Each new formula is produced by multiplying the formula 
before by a+ b. When this is done, each term of the old formula is 
multiplied by a to give terms of the new formula, and separately multiplied 
by b to give further terms of the new formula (before the like terms are 
collected). This raises the sum of the powers in each term by 1. So, for 
example, since the sum of the powers in each term in the formula for 
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(a +b)? is 2, it follows that the sum of the powers in each term in the 
formula for (a +b)? is 3, and so on. 


This property means that the terms on the right-hand side of each formula 
can be arranged in a standard order, as is done above. In the first term, a 
is raised to the same power as the brackets on the left-hand side. In each 
subsequent term, the power of a is decreased by 1 and the power of b is 
increased by 1 until, in the last term, b is raised to the same power as the 
brackets. 


The second thing to notice about the formulas is that each coefficient on 
the right-hand side is the sum of the two adjacent coefficients in the line 
above. For example, the coefficient of atb? in the formula for (a + b)® is 15, 
which is the sum of the two adjacent coefficients 5 and 10 in the formula 
for (a+ b)°, as shown below. 


(a +b)’ = a Bub +0036? + 10a?b? + 5abt + b 


waal 
(a+b) = a® + 6a5b +(15a*b? + 20a3b? + 15a7b4 + babë + b° 


To see why this always happens, again look at and think about the 
working for (a + b)? and (a + b)t, at the steps where the like terms are 
aligned. You can see that each coefficient in each new formula is obtained 
by adding two adjacent coefficients from the formula before. 


This means that you can find the coefficients for any of the formulas from 
the triangular array below, which is known as Pascal’s triangle. The 
array has 1s down each edge, and each of its other numbers is the sum of 
the two adjacent numbers in the line above. It can be continued 


indefinitely. 
(a +b)? 1 
(a+b)! 1 1 
(a+b)? 1 2 1 
(a +b)’ 1 3 3 1 
(a +b)* 1 4 6 4 1 
(a +b)’ 1 5 10 10 5 1 
(a+b) 1 6 15 20 15 6 1 
o 
° ee 
Notice that the array is symmetrical — the numbers in each row are the 2 e 2o cS 
same whether you read them from left to right or from right to left. 1 3 6 
Pascal’s triangle contains many interesting sequences. For example, look at e eve 
the diagonal lines of numbers. The first diagonal is 1,1,1,1,1,..., and the ose oe 6 o 
second diagonal contains the natural numbers 1, 2,3,4,5,.... The third iy A mip eee 
diagonal contains the numbers 1,3,6,10,15,..., which are known as the w 15 


triangular numbers, because they correspond to triangular patterns of 
dots, as shown in Figure 26. There are many other less obvious patterns in Figure 26 Triangular 
the triangle. numbers 
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Figure 27 Notation for 
binomial coefficients 
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Activity 35 Using Pascal's triangle to find a formula for (a + b)" 


(a) Calculate the row of Pascal’s triangle that gives the coefficients for the 
formula for (a + b)”. 


(b) Hence write down the formula for (a + b)". 


Activity 36 Expanding an expression of the form (a + b)” 


For each of the following expressions, use one of the formulas that you 
have seen in this subsection to expand the brackets. 


(a) (3x +2) (b) (z — 4y)“ 


The numbers in Pascal’s triangle are called binomial coefficients. They 
can be denoted as follows. If the rows of Pascal’s triangle are numbered as 
row 0, row 1, row 2 and so on, and the coefficients within each row are 
numbered from left to right as coefficient 0, coefficient 1, coefficient 2 and 
so on, then coefficient k in row n is denoted by ”Cķ, as illustrated in 
Figure 27. For example, 


30 = 1, 301 =3, 03 and 3C3=1. 
The letter C in this notation doesn’t stand for ‘coefficient’, as you might 
expect, but rather for ‘choose’ or ‘combination’ — the reason for this is 
mentioned shortly. The notation "C; is read as ‘n choose k’ or just as 
nCk’. 


A common alternative notation for the binomial coefficients is 


(i) 


which is again read as ‘n choose k’. (This notation has no connection with 
the column notation for two-dimensional vectors that you met in Unit 5, 
though it looks the same.) 


With the notation "Ck, the general formula for (a + b)” can be written as 
follows. 


The binomial theorem 
For any natural number n, 
(a aE D = Gi qa” a en ab EE COM ab fe oo ae 


where "Cp is coefficient k in row n of Pascal’s triangle (where the 
rows, and the coefficients within each row, are numbered 0, 1, 2, ...). 


The right-hand side of the equation in the binomial theorem is known as 
the binomial expansion of (a + b)”. 


A good way to remember the terms in the equation in the binomial 
theorem is to use the fact that the powers of a and b on the right-hand side 
of the equation follow the pattern described earlier. In the first term a is 
raised to the same power as the brackets on the left-hand side, and in each 
subsequent term the power of a is decreased by 1 and the power of b is 
increased by 1, until in the last term b is raised to the same power as the 
brackets. To complete the expansion, you just need to include the 
coefficients of the terms, which are the binomial coefficients 

Co, ”C1,”C2,...,” Cn. 


Intriguingly, the binomial coefficients also occur in a seemingly unrelated 
area of mathematics, concerned with finding answers to questions such as 
‘How many different sets of six lottery numbers can be chosen from 49 
possible numbers?’ This is the context which led to "Cy being read as 

‘n choose k’. You’ll learn about it if you go on to study the module 
MST125 Essential mathematics 2. 


Pascal’s triangle is named after the French mathematician and 
philosopher Blaise Pascal. He was far from the first person to study 
this array of numbers, but his work on it in his Traité du Triangle 
Arithmétique was influential. Research on binomial coefficients was 
also carried out at about the same time by John Wallis (1616-1703) 
and then by Isaac Newton (1642-1727), who discovered that the 
binomial theorem can be generalised to negative and fractional 
powers. You'll learn about this in Unit 11. 


Pascal contributed to many other areas of mathematics in his short 
life. He worked on conic sections and projective geometry, and, 
together with Pierre de Fermat, laid the foundations for the theory of 
probability. 


Pascal’s triangle was studied centuries earlier by the Chinese 
mathematician Yanghui and the Persian astronomer and poet Omar 
Khayyám, and is known as the Yanghui triangle in China (see 
Figure 28). 


5.2 A formula for binomial coefficients 


Pascal’s triangle is a convenient way to obtain the binomial coefficients for 
fairly small values of n. However, it would be tedious to obtain a 
particular coefficient in the expansion of (a + b)!*, say, by using Pascal’s 
triangle. It is therefore desirable to have a closed-form formula for "Cx, 
and you'll meet such a formula in this subsection. First you need to learn 
about factorials, which occur in the formula for ” Ck. 
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Figure 28 The Yanghui 
triangle (Pascal’s triangle), 
from a publication of Zhu 
Shijie, AD 1303 
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For any natural number n, the product of all the natural numbers up to 
and including n is called the factorial of n, and denoted by n! (this 
notation is read as ‘n factorial’ or ‘factorial n’). So 


n!=1x2x3x---xn. 
For example, 

5l=1x2x3x4x5 = 120. 
We also define 

0! = 1, 


because this interpretation of 0! works well with formulas that involve 
factorials, as you'll see shortly. 


The values of n! increase rapidly as n increases. The first few values are 
shown in Table 1. 


Table 1 Values of n! 
n 01 2 3 4 5 6 7 8 9 10 
n! 1 1 2 6 24 120 720 5040 40320 362880 3628800 


Many calculators can evaluate n!, often for values of n up to 69. (The 
value of 69! is about 1.7 x 109°, whereas 70! is about 1.2 x 10100, 
Typically, calculators don’t carry out calculations that involve numbers 
greater in magnitude than 101%.) 


Notice that 
i=1xoO!l, 2!=2x1!, 3!=3x2!, and so on. 

In general, n! can be expressed in terms of (n — 1)! as follows: 
m= 1) (m= 1,2, 3a): 


So the sequence of factorials, 1,1, 2,6,24,120,..., is generated by the 
recurrence system 


co = 1, Ga = neni t= 12 3e) 
You met this recurrence system, and its first few terms, in Example 3(c) 
on page 13. 


There’s a formula for the binomial coefficients that can be stated concisely 
in terms of factorials, as below. You’ll see a justification of this formula at 
the end of this section. 


If n and k are integers with 0 < k < n, then 


n! 


oS kl (n — k)! 


(13) 


For example, this formula gives 
5! 120 
5 
C2 = — = = 10 
aoa axe 


which accords with the value of °C in Pascal’s triangle. 


When you use the formula above to evaluate binomial coefficients by hand, 
it’s generally best not to evaluate the three factorials in the formula and 
then do the division. There’s usually an easier and quicker way to proceed. 
To see this, consider the following calculation of ‘C3 using the formula, 
which is written out in full: 

70 o T! 7x6x5x4x3x2x1 

3 34l (8x2x1)x(4x3x2x]1) 

The two occurrences of 4 x 3 x 2 x 1 on the top and bottom cancel out, so 


7x6x5 
To — 
Ce IKI 


We can now do some more cancelling, and hence obtain the value “C3 = 35. 


In general, whenever you use formula (13) to work out a value of "Cj, 
where k is not equal to either 0 or n, the factorial (n — k)! in the 
denominator cancels with the ‘tail’ of the factorial n! in the numerator. 
The numerator then contains the product of all the integers from n down 
to the integer that’s one larger than n — k, and the denominator contains 
just k!. So, when k is not equal to 0 or n, formula (13) can be restated in 
the following form, which is not as neat but usually easier to apply. In fact 
this form of the formula also applies when k = n, but in this case it’s just 
as easy to use the original form. 


If n and k are integers with 0 < k <n, then 
n(n —1)---(n-—k+1) 


Ck = Kea iyo 


This formula has k factors on the bottom and also k factors on the top. 
So, to apply it, you start with the integer n on the top and the integer k 
on the bottom, and for each of these integers you keep multiplying by the 
next integer down until you have k factors in total. 


This is illustrated in the next example, but first here’s one more fact to 
keep in mind when you’re evaluating binomial coefficients. Remember that 
the binomial coefficients in each row of Pascal’s triangle are the same no 
matter whether you read them from left to right or right to left. In other 
words, we have the following general fact. 


"On = Co 


So, for example, if you want to evaluate !°C7 using the method described 
above, then it’s better to evaluate !°Cj9_7, that is, '°C3, instead. You'll 
get the same answer, but the working will be easier, because you'll have 
only 3 factors on each of the top and bottom of the fraction, instead of 7. 
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Example 20 Evaluating binomial coefficients using the formula 
Evaluate the following binomial coefficients without using a calculator. 
la) Ca (by Cis (c) 4Cy 

Solution 


(a) @. Write down a fraction with 9 on the top and 4 on the bottom. 
For each of these integers, keep multiplying by the next integer 
down until you have 4 factors. & 

OLELE 

9 
Cy = ——— 
Perera 


@. Do all the cancelling that you can do, and hence evaluate the 
fraction. & 


= 0 2x T= 125 
(b) &. Use the fact that “C15 = !"Cy7_15, then proceed as in 
part (a). ® 
17 x 16 


KOTE "Cp = =17x8=136 


ZX AL 
(c) @. Since n = k here, use the original form of formula (13), rather 
than the alternative form. Remember that 0! = 1. ® 
4! 4! 
= — = = | 
MOT Al x1 


“Oh 


Activity 37 Evaluating binomial coefficients using the formula 


Evaluate each of the following without using a calculator. 
(a) C4 (b) Cio (ce) 50s (d) 5Co (0) Ca 


Another way to evaluate binomial coefficients is simply to use your 
calculator. (You should find that "C; is a function for one of the buttons.) 
Activity 38 Evaluating binomial coefficients using a calculator 


Evaluate each of the following by using a calculator. 
(a) Cis (b) Cir 


The binomial coefficients were known to early Islamic 
mathematicians. For example, al-Karaji, who flourished early in the 
11th century, provided a table of binomial coefficients in a text (now 
lost but known through a copy in The Brilliant in Algebra by 
al-Samaw’al (1125-1174)), while the formula for them was given by 
al-Kashi (d. 1429) in his book The Calculator’s Key. A proof that 
this formula is correct was given by Pascal. 


5.3 Working with the binomial theorem 
In Subsection 5.1 you saw a statement of the binomial theorem, as follows: 
For any natural number n, 

(a J b)” = aC a” j- oC, ap erp nO i abt Ç HGA b”, 
where "Cx is coefficient k in row n of Pascal’s triangle (where the rows, 
and the coefficients within each row, are numbered 0,1, 2,...). 


In Subsection 5.2 it was stated that the binomial coefficients are given by 

n! 
eel 
Hence the binomial theorem can now be restated, without direct reference 
to Pascal’s triangle, as follows. 


"Ch (13) 


The binomial theorem (restated) 
For any natural number n, 

lati" =" Ca" + (Cra see Crmay = = Cb 
where 


"Ch, = (k =0,1,2,...,n). 


The binomial theorem can also be written in sigma notation, as follows. 


The binomial theorem (sigma notation) 
For any natural number n, 


n Min n—kyk _ “ n! n—kyk 
(a +b) 2 Cpa i a bo 


It’s useful to be familiar with the first few binomial coefficients in the 
expansion of (a+ b)”. The formula for "C;, gives 


n(n — 1) nC, = n(n — 1)(n — 2) 


"‘Co=l “=m "O= a = 31 
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® Example 21 Using the binomial theorem 


—- Use the binomial theorem to find the first four terms in the expansion 


of each of the following expressions. 
@ Gag (by 2—2)" 
Solution 
(a) Œ. In the binomial theorem, put n = 8, a = z and b = y. & 
The first four terms in the binomial expansion of (x + y)? are 
2® + 8Cya'y + 8Con®y? + Ca 
ae, 2 ne y 


m a 


=x? + 8x'y + 28xfy? + 56xy?. 


= e y 


(b) & In the binomial theorem, put n = 7, a = 2 and b = —z. # 


The first four terms in the binomial expansion of (2 — x)’ are 


DG) <2 (27) Cy 2 (a) (Cn 2 (a) 


7x6 7x6x5 
= 2 -Tx 8x + = x 282? — =A x 248 
= 128 — 4482 + 67227 — 5602°. 


Activity 39 Using the binomial theorem 


Use the binomial theorem to find the first four terms in the expansion of 


each of the following expressions. 


(a) (+2) (b) (2442) 


Sometimes it’s useful to find a particular term in a binomial expansion. 


The next example demonstrates how to do this. 


Cc) Example 22 Finding a particular term in a binomial expansion 


Find the coefficient of x!y° in the expansion of 


(22— $y)”. 
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Solution 


By the binomial theorem, each term in the expansion is of the form 


2101, (Qar)21-k unr = 20), x Q2l- ky 21k e yt 


@. Write the powers of xz and y at the end, and simplify the powers 
of 2. ®@ 

= O D E 

= 210, (—1)21 -2k z?1-kyk, 


The term in zt?y’ is obtained when k = 9. Hence the coefficient of 
1D, © 


py aS 
21. (—1)9Q?1-2x9 = —21Cy x 2 
®. Use a calculator to evaluate ?!Co. @ 
= —293 930 x 8 
= —2351 440. 


Activity 40 Finding particular terms in binomial expansions 


(a) Find the coefficient of afb” in the expansion of (a + b)!!. 


(b) Find the coefficient of c°d!° in the expansion of (3c — d)”. 


Here’s a slightly more complicated example. 


Example 23 Finding a particular term in another binomial CS) 
expansion = 


Find the coefficient of p* in the expansion of 


1 iy 
2 e 
(> as T l 


Give your answer as a fraction in its lowest terms. 
Solution 


By the binomial theorem, each term in the expansion is of the form 


iri 1 k ' ) 1 k 1 k 
17 A E a 27k E = 
Ce (P") = ae () G) 
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Activity 41 Finding particular terms in more binomial expansions 


(a) Find the constant term in the expansion of 


e" 


(b) Consider the expansion of 


15 
pe a 
2h? 
(i) Find the coefficient of h3. 


(ii) Find the coefficient of h~”. 


(iii) Show that there is no term in h?. 


An important special case of the binomial theorem, which occurs 
frequently, is obtained by taking a = 1 and b= x. This gives the following 
expansion. 


For each natural number n, 
Te La Cr 2 On CO oae ao 
(n ) 2 a n(n )( 


nm—2 
a 7 E 


So 
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Activity 42 Applying a special case of the binomial theorem 


Use the form of the binomial theorem in the box above to evaluate the first 
six terms of the expansion of (1 — 2)!". 


Various interesting results can be obtained by choosing particular values 
for a and b in the statement of the binomial theorem. For example, if 
a=b=1, we have that 

14+"C, +” Co + "C3 +- +”Chn-1 aE 


In sigma notation, this is 


n 


XO "Oh = 2": 


k=0 
Hence, for each natural number n, the finite series formed from the 
sequence of binomial coefficients "Cz, k = 0,1,2,...,n, has sum 2”. In 
other words, the nth row of Pascal’s triangle has sum 2”. 


A proof of the formula for the binomial coefficients 


We can prove that the formula 
n! 
~ k(n —k)! 


is correct by proving the following two facts. 


"Oh (14) 


1. Formula (14) gives the correct result for each of the ‘edge values’ of 
Pascal’s triangle — that is, for n = 0,1,2,..., 


"Co = 1 and "Cn = 1, 


2. Formula (14) has the key property of Pascal’s triangle described on 
page 73 — that is, each binomial coefficient (apart from the edge 
values) in each row is the sum of the two adjacent coefficients in the 
row above. 


Fact 1 holds because, for n = 0,1,2,..., formula (14) gives 


n! ra n! 
=o oS Se a 


n 


Co 


Proving fact 2 takes more work. Consider any binomial coefficient ” Ck 
that isn’t an edge value (so k 4 0 and k Æ n) in row n of Pascal’s triangle. 
The two adjacent coefficients in the row above (row n — 1) are 


ao and n, 
Fact 2 can therefore be expressed algebraically as the equation 
Ok = m + n=l, (15) 


for n = 2,3,4,... and k = 1,2,3,...,n — 1. We now verify that this 
equation holds. 
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The formula 


a n! 
Oe Fi (n— k)! 
gives 
siga ea (n~1)! 
k((n—1)—k)! k!(n—k-1)! 
and 
ae (n—1)! (n—1)! 


k=- kk 

Hence the expression on the right-hand side of equation (15) is 
(n—1)! (n—1)! 

(k-1)!(n—k)! ki(n-k-1)! 


Multiplying the top and bottom of the first fraction by k, and the top and 
bottom of the second fraction by n — k, gives 


k(n=1)! (n—k)(n—-1)! 


ae ae ale ee = 


oa As a me oe = 


k! (n — k)! k!(n — k)! 
o (k+n—-k)(n-1)! 
k! (n — k)! 
n(n — 1)! n! =", 


~ k(n- k)!  kl(n— k)! 
which is the left-hand side of equation (15). This completes the proof. 


Learning outcomes 


After studying this unit, you should be able to: 
e understand and use standard terminology and notation for sequences 


e find the parameters of an arithmetic or geometric sequence from the 
first few terms, and hence write down a recurrence system for the 
sequence 


e find a closed form for an arithmetic or geometric sequence, given the 
first few terms of the sequence or a recurrence system for it 


e draw the graph of the first few terms of a sequence 

e determine the long-term behaviour of certain simple sequences 

e find the sum of a finite arithmetic or geometric series 

e where possible, find the sum of an infinite geometric series 

e understand the link between Pascal’s triangle and binomial expansions 


e find terms in the binomial expansion of an expression of the form 
(a +b)”, by applying the binomial theorem and evaluating binomial 
coefficients. 


Solutions to activities 


Solution to Activity 1 


(a) The first term of the sequence 


(bn); with terms 1, 4, 7, 10, 13, 16, 19, ... 


is the term with subscript 1; that is, bı = 1. 
The fourth term is b4 = 10. 


(b) Counting along the terms of the sequence, 


16 is the 6th term. Hence if bp, = 16, then n = 6. 


(c) No, bo is not defined, because 0 is not included 
in the range of values n = 1,2,3,.... 


Solution to Activity 2 


(a) ai= To az= 14, a3 — 21, ay = 28. dg = 35, 
a100 = 700. 

(b) by 1, b2 5, bs i, b4 $, bs, E, 
b100 = T55: 


(You may have converted these simple fractions 
to decimals, but there is no need to do this in 
such cases.) 


c3 = 1, c4 = —1, c5 = 1, C100 = —1 
(@) a= (-1)! x1 =-1, 
dy = (—1)? x 2 = , 


ds = —3, d4 = 4, ds = —5, dioo = 100. 
(e) e1 = (—2)' = -2, 

eg = (—2)? = A, 

e3 = —8, e4 = 16, e5 = —32, 

e100 = (—2)10° = 1.27 x 10% (to 3 s.f.). 


Solution to Activity 3 
(a) We have a; = 1, a2 
A closed form is 

(n = 1,2,3,...), 
and the 10th term is ajo = 10. 


2, a3 3, a4 4. 


a, = T 


(b) We have a; = 2 = 2!, ag = 27, ag = 2’, ay = 24. 


A closed form is 
an =2" (n=1,2,3,...). 
The 10th term is aio = 21° = 1024. 


(c) We have a1 = —1, a2 = 1, a3 = —1, a4 = 1. 
A closed form is 
an = (1) (n=1,2,3,---), 


and the 10th term is aig = (—1)19 = 1. 


Solutions to activities 


(d) We have a; = 13, az = 23, ag = 33, ag = 4°. 
A closed form is 
i=" (t= 1,932.) 
and the 10th term is ajo = 10° = 1000. 
(e) We have aj = 6 x 1, ag = 6 x (—1), ag = 6 x 1, 
a4 = 6 x (—1). A closed form is 
i= G6 (N 15 9,3)053). 
Since (—1)"*1 = (—1)"(—1)! = —(-1)", this 
closed form can also be written as 
an = —6 x (—1)” (n=1,2,3,...). 
The 10th term is aio = 6 x (—1)! = —6. 
(f) We have a; = 4, 


A closed form is 
n 


2 3 4 
a2 39 a3 J: G4 5: 


ün = (m=1; 23s). 


= n+1 
Since n = (n + 1) — 1, this closed form can also 
be written as 


(n = 1,2,3,...). 


an = 


n+l 
The 10th term is aio = E, 
(g) We have a; = 2 x 1, ag = —2 x 2, ag = 2 x 3, 
a4 = —2 x 4. A closed form is 
an = I2n(—<1)""* (n=1,2,3,...). 
Since (—1)"*1 = (—1)"(—1)! = —(—1)”, this 
closed form can also be written as 
an = —2n(—1)” (n =1,2,3,...). 
The 10th term is aio = 2 x 10 x (—1)!! = —20. 


Solution to Activity 4 
(a) (i) ao 30 1, ay 3° 


3, a2 = 3 =9. 


(ii) be $, b3 z, b4 b 
(ili) cr = 4, & = $, 63 = h- 
(b) (i) We have do = 3°, dı ZR dy =), d3 = 3°. 
A closed form is 
dn =3" (n=0,1,2,...), 
and the sixth term is d5 = 3° = 243. 
(ii) We have es F, e6 z, e7 z, eg Z. 
A closed form is 
(n=5,6,7,...), 


Cy = = 
n 


and the sixth term is e10 = 75. 
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(iii) We have fo = ee fs = aoe 
fa = Sy Js= ely A closed form is 


fn = (-3)”  (n =2,3,4,...), 
and the sixth term is f7 = ey = -z 
Solution to Activity 5 
(a) (i) bp =2(n+1) (n=0,1,2,...) 
(ii) b, =3" (n=0,1,2,...) 
(iii) b, =6+n (n=0,1,2,...) 
Gv) bn = An (m= 0,1,2..2.) 
bG) GOA" (n123) 
(ii) bp =5(n—1) (n=1,2,3,...) 
(ii) ea (n =1,2,3,...) 
(iv) bn =2+3(n-—1) (—1,2,3,...), 
which simplifies to 
bn =—1+3n (n=1,2,3,...). 
3n—2 
(c) (i) bp = (w=2/3,4).,.) 
m- 
a t= ae 
(n —1)(n +1) 


Solution to Activity 6 


3, a4 7, a5 15. 
—l, b4 = 0, bs =—-1. 


(a) ay 0, a2 L a3 

(b) by 1, by 0, bs 

(c) Co = 2, cy = 1.5, 
c2 = 17/12 = 1.416667 (to 6 d.p.), 


c3 = 577/408 = 1.414216 (to 6 d.p.), 
c4 = 1.414214 (to 6 d.p.). 


Solution to Activity 7 


(a) The sequence (£n) is arithmetic, with 
parameters a = —1 and d= 1. 


(b) The sequence (yn) is not arithmetic. The term 
Yn—1 on the right of the recurrence relation is 
multiplied by —1. 

(c) The sequence (zn) is arithmetic, with 
parameters a = 1 and d = —1.5. 
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Solution to Activity 8 


(a) The first term is a = 1, and the common 
difference is d = 4— 1 = 3. So the recurrence 
system is 

r= 1, =e ES HS 2A ows) 
The next two terms are 

zs = z4 +3 = 10 +3 = 13, 

ze = z5 +3 = 13+3 = 16. 

(b) The first term is a = 2.1, and the common 
difference is d = 3.2 — 2.1 = 1.1. So the 
recurrence system is 

(n= 23,8 esa): 


Y= 21 Yn = Yn-1+1.1 


The next two terms are 
ys = ya + 1.1 = 5.4 + 1.1 = 6.5, 
Ye = Ys + 1.1 = 6.5 + 1.1 = 7.6. 
(c) The first term is a = 1, and the common 


difference is d = 0.9 — 1 = —0.1. So the 
recurrence system is 


z= 1, Zn = Zn- = 0.1 (n=2,3,4,.2.,11): 
The next two terms are 
z5 = z4 — 0.1 = 0.7 — 0.1 = 0.6, 


z6 = z5 — 0.1 = 0.6 — 0.1 = 0.5. 


Solution to Activity 9 


(a) The first term is 1000, the last term is 10 and 
the common difference is d = 970 — 1000 = —30. 
Hence the number of terms is 

10 — 1000 
—30 
so there are 34 terms in the sequence. 


+1 = 34, 


(b) The corresponding recurrence system is 


zı = 1000, Zn =2n-1—-30 (n=2,3,4,...,34). 


Solution to Activity 10 


(a) Since a = 1 and d = 3, the closed form is 
In =14+3(n—-1) 
=3n—-2 (n=1,2,3,...). 
This gives 74 = 3 x 4 — 2 = 10, as expected, and 
also 
19 = 3 x 10 — 2 = 28. 


(b) Since a = 2.1 and d = 1.1, the closed form is 
Yn = 2.1 + 1.1(n — 1) 


=1lln+1 (n=1,2,3,...). 
This gives y4 = 1.1 x 4+ 1 = 5.4, as expected, 
and also 


Yio = 1.1.x 104+1=12. 


(c) Since a = 1 and d = —0.1, the closed form is 
zn = 1—0.1(n— 1) 
=11-0.1n (n=1,2,3,...,11). 
This gives z4 = 1.1 — 0.1 x 4 = 0.7, as expected, 
and also 
zıo = 1.1 — 0.1 x 10 = 0.1. 


Solution to Activity 11 


(a) The sequence (£n) is geometric, with 
parameters a = —1 and r = 3. 


(b) The sequence (yn) is geometric, with 
parameters a = 1 and r = —0.9. 


(c) The sequence (zn) is not geometric, because the 
expression on the right of the recurrence 
relation contains the term +1. 


Solution to Activity 12 


(a) The first term is a = 1, and the common ratio is 
r= $/1 = 5. So the recurrence system is 


t= l, Tn = $£n—1 (m= 2, BAe): 


The next two terms are 


eee. ee 
T5 = 944 = 3 X § = Te? 
EE P hi N 
T6 = 375 = 3 X TG = 32 


(b) The first term is a = 4.2, and the common ratio 
is r = 7.14/4.2 = 1.7. So the recurrence system 
is 


yy = 4.2, Yn = 1.7Yn-1 
The next two terms are 
Ys = 1.7y4 = 1.7 x 20.6346 
= 35.078 82 
= 35.079 (to 3 d.p.), 
Ys = 1.7y5 = 1.7 x 35.078 82 
= 59.633 994 
= 59.634 (to 3 d.p.). 


(m= 23A eia): 


Solutions to activities 


(c) The first term is a = 2, and the common ratio is 
r = (—2)/2 = —1. So the recurrence system is 
(t= 2,342): 


Z1 = 2; Zn = —2n-1 


The next two terms are 
Z5 — —Z4 = —(—2) = 2, 


£6 = 25S —2. 


Solution to Activity 13 


(a) Suppose that the sequence has N terms, with 
first term 21 = 7. Then the last term is 
zn = 2734375. The common ratio is 
PE 2) =. 
Now N is given by 


yN-1 — a 
Z1 
that is, 
2734 
gei AEST = 390625. 


Taking the natural logarithm of both sides of 
this equation gives 

(N — 1)1n5 = 1n(390 625), 
from which 


Vea. ee 
4 ln5 

Hence there are nine terms in the sequence. 
(b) The corresponding recurrence system is 


2 =T Zn = Sent (m=2 34a) 


Solution to Activity 14 
(a) Since a = 1 and r = 4, the closed form is 
by = 1x (4) 


= (p 


This gives x4 = ( 
and also 
—(1\9 _ 1 
zo = (3) =a 
= 1.953 x 107 (to 4 s.f.). 
(b) Since a = 4.2 and r = 1.7, the closed form is 
Yn SAI" (p18 Fs 
This gives y4 = 4.2 x 1.7% = 20.6346, as 
expected, and also 
Yio = 4.2 x 1.79 = 498.1 (to 4 s.f.). 


(n = 1,2,3,...). 
n = Ł, as expected, 
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(c) Since a = 2 and r = —1, the closed form is 
SO: Sa 
This gives z4 = 2(—1)? = —2, as expected, and 
also 


zo = 2(—1)° = —2. 


Solution to Activity 15 


The first point to be plotted is (1, 21) = (1,1). The 
second point is (2, z2) = (2,0.9). The subsequent 
points are (3,0.8), (4,0.7), (5,0.6) and (6,0.5). The 
graph is as follows. 


Zn 


Solution to Activity 16 
(a) The first point to be plotted is (1, y,) = (1, 4.2). 
The second point is 
(2, y2) = (2,4.2 x 1.7) = (2,7.1). 
The subsequent points (to 1 d.p.) are (3, 12.1), 
(4, 20.6), (5, 35.1) and (6,59.6). The graph is as 


follows. 
Ynk 
60- e 
40- 
e 
204 ° 
e 
e 
0 > T T T T T nh 


(b) The first point to be plotted is 
(1,1) = (1,(—1)”) = (1,1). 
The second point is 
(2, c2) = (2, (—1)*) = (2, 1). 
The subsequent points are (3, 1), (4,—1) 
and (5,1). The graph is as follows. 
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Cyr 


=e 


The dashed lines shown above are not part of 
the graph of the sequence, but draw attention 
to the fact that the terms of this sequence 
alternate in sign. 


Solution to Activity 18 
The graph in Figure 9(b) shows that, for this 


sequence, 
(or lim rz, = L) : 
n-co 


The graph in Figure 9(c) shows that, for this 
sequence, 


In > Lasn->o 


Ln > —œ as n — OO. 


The graph in Figure 9(e) shows that, for this 
sequence, 


In > 0 as n => œ (or lim @n =0). 


n— oo 


Solution to Activity 19 


The sequence (£n) with closed form £n = 3 — Ẹn is 
arithmetic, with common difference d = -4, Since 
—3 < 0, the sequence (£n) is decreasing and 


Ln > —œ as n — OO. 


Solution to Activity 20 


(a) Since r = 1.2 > 1, the sequence (1.2”) is 
increasing and 1.2” + co as n > oo. To obtain 
(zn) we multiply each term by the negative 
constant —4. Hence (£n) is decreasing and 
In + — as n > oo. 


Since —1 < —0.9 < 0, the sequence ((—0.9)”) 
alternates in sign and (—0.9)” > 0 as n > ov. 
To obtain (yn) we multiply each term by the 
non-zero constant 5. Hence (yn) alternates in 
sign and yn > 0 as n > oo. 


(c) Since —2.5 < —1, the sequence ((—2.5)”) 
alternates in sign and is unbounded. To obtain 
(zn) we multiply each term by the negative 
constant —7;. Hence (zn) also alternates in sign 
and is unbounded. 

(Details of how to use the CAS to plot graphs of 
these sequences are given in the Computer 
algebra guide, in the section ‘Computer methods 
for CAS activities in Books A—D’.) 


Solution to Activity 21 


(a) From Activity 20(a), the sequence with terms 
Yn = —4 x 1.2” is decreasing and yn —> —0o 
as n — oo. Since an = Yn + 17 for each n, it 
follows that the sequence (an) is also decreasing 
and an — —œ as n > œ. 


From Activity 20(b), the sequence with terms 
zn = 5(—0.9)” has limit 0 and its terms are of 
alternating sign. Since bn = Zn + 45 for each n, 
it follows that ban —> 45 as n — oo; that is, 


lim bp = 45. 


n—> oo 
Also, the terms of (bn) alternate either side 
of 45. 


(Details of how to use the CAS to plot graphs of 
these sequences are given in the Computer 
algebra guide, in the section ‘Computer methods 
for CAS activities in Books A—D’.) 


Solution to Activity 22 


The sequence is 6,13, 20,...,90. The first term is 6, 
the last term is 90 and the common difference is 
d = 13 — 6 = 7, so the number of terms is 
90 —6 
1=18. 
7 + 


Hence the sum is 
$ x 13 x (6 + 90) = 13 x 48 
= 624. 


Solution to Activity 23 
Expression (5) is used in each case. 
(a) Here a = 1, d = 1 and n = 100. The sum is 
5 x 100 x (2x 1 +99 x 1) = 50 x 101 
= 5050. 
(b) Here a = 12, d = 3 and 


— 12 
nee +1=17. 


Solutions to activities 


The sum is 
$x 17x (2x 12+16x3)=4x 17 x 72 
= 612. 
(c) Here a = 1, d = 2 and 
ps ed ie 
2 
The sum is 
$x 10x (2x1+9x2)=5 x 20 
= 100. 


Each of these answers could also have been 
obtained using expression (4). 


(Notice that the result of part (c) is another 
example of a pattern observed near the start of this 
subsection, namely that adding up consecutive odd 
numbers, starting at 1, always seems to give a 
square number. Here’s how you can prove that this 
pattern holds in general. 


The sum of the first n odd numbers forms an 
arithmetic series with first term a = 1, common 
difference d = 2 and n terms. By the formula in the 
box above this activity, this series has sum 

in(2 x 1+ (n—1) x 2) = $n(2 + 2n — 2) 


=l 
= 5n X 2n 


=n?) 
Solution to Activity 24 
(a) The numbers added form a geometric sequence 
whose 10 terms can be expressed as 
3" (n =1,2,...,10). 
The first term is a = 3 and the common ratio 
is r = 3. Using expression (7) with n = 10, the 
sum of these terms is 
3 (1 — 37°) 
1-3 


= ł(59 049 — 1) 
= 88 572. 


The numbers added form a geometric sequence 
with first term 1 and common ratio -—4, and 
with 9 terms. (Be careful when working out the 
number of terms of a geometric sequence that 


has first term 1.) From expression (7) the sum is 


= 0.750038 (to 6 d.p.). 
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Solution to Activity 25 
(a) (i) The formula for the first n natural 
numbers is 
14+243+4+---4+n=$n(n+1). 
Here n = 30, and so 
14+2+4+3+---+30= į x 30x 31 
= 465. 
(ii) The formula for the squares of the first n 
natural numbers is 
12422432 4...4 p72 
= {n(n + 1)(2n + 1). 
Here n = 10, and so 
1742743? 4---+10° 
=%x10x 11x21 
= 385. 


(iii) We use the same formula as in part (a)(ii). 


Here n = 30, and so 
1? +2? +3? +- +30? 
= ¿į x 30 x 31 x 61 
= 9455. 


(iv) The formula for the cubes of the first n 
natural numbers is 


1? +2 43% o Hn? = in? (n1). 
Here n = 30, and so 
1? + 2° +3 +---+30° = Ł x 30? x 31? 
= 216225. 
(b) (i) Note that 
11? + 127 +13? +--- +30? 
= (17 +2? 43?+--- +307) 
— (17+2?4+37+4---+107). 
It follows from the results of parts (a) (ii) 
and (iii) that 
11? +12? + 13? +. --- + 30? 
= 9455 — 385 
= 9070. 
(ii) Note that 
20° + 21° + 22° + --- + 40° 
= (18 +23 +33 +---+ 403) 
— (18 +29 +38 +--+ 19°). 
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Applying the formula from part (a) (iv), 
we have (with n = 19) 
1% 4233 p0 
= $x 19? x 20? 
= 36 100 
and (with n = 40) 
13 + 23 +33 +. ---+ 40° 
= 7 x 40? x 41? 
= 672 400. 
It follows that 
20° + 21° + 223 +... + 40° 
= 672 400 — 36 100 
= 636 300. 


Solution to Activity 26 


(a) 


This is an infinite geometric series with first 
term a= 3 and common ratio r = 3. Since 
—1 < r < 1, the series has a sum, namely 


=. 


ALE] Alco 


This is an infinite geometric series with first 
term a = 1 and common ratio r = —4. Since 
—1 < r < 1, the series has a sum, namely 
Qo 1 ol , 
raas rer e y aE * 
This is an infinite geometric series with first 
term a = 1 and common ratio r = —2. Since 


r < —1, the series doesn’t have a sum. 


This is an infinite geometric series, with first 
term a= + and common ratio r = —3. Since 
—1 < r < 1, the series has a sum, namely 


Sle 


Solution to Activity 27 


(a) 


Put s = 0.454545.... The repeating group, 
‘45’, is 2 digits long, so multiply s by 107, to 
obtain 


100s = 45.454545...= 45 + s. 


45 _ 5 
Hence we have s = 3 = TT 


(b) Put s = 0.729729 729.... The repeating 
group, ‘729’, is 3 digits long, so multiply s 
by 10°, to obtain 

1000s = 729.729 729 729... = 729+ s. 


Hence we have s = = = H, It follows that 


= 27 _ 138 
3.729 729 729... = 3 + 20 = 138, 


Solution to Activity 28 


(a) The sums are as follows. 

20 
X nt = 544644744 
n=5 
19 

(i) XO (n1) = 544644744 
n=4 
6 

(iii) SO(n -1) =14+34+5+4---4+11 


n=1 


-- + 204 


-+ + 204 


The sum is 
150 


-+150= Son. 
n=1 


14+24+34+-- 


(ii) The sum is 


13 
5+6 +7 +---4+137 = Son? 
n=5 
(iii) The sum is 
Fo OP . +l? — ye 


Solution to Activity 29 


24 
(a) So k= 4 x 24x (2441) = 300 
k= 1 
24 
(b) XOK? = hx 24x (2441) x (2x 2441) 
k=l 
= % x 24 x 25 x 49 
4900 


Solution to Activity 30 


The solutions below apply the rules for 
manipulating finite series using sigma notation, in 
the box on page 66, and the formulas for the sums 
of standard finite series in the box on page 65. 


Solutions to activities 


=o 2 
=P - Sk 


= 2x $(30) 231)? — $(30)(31) 
= 432 450 — 465 
= 431985 


40 40 
O ae y= 
k=1 k=1 
40 40 
-5P -Di 
k=l k=l 


+ x #(40)(41)(81) — 40 


(a) Sok — 


= 5535 — 40 
= 5495 
(c) We have 
125 125 64 
XO (6k +7) = 5 (6k +7) — 5° (6k +7). 
k=65 k=1 k=1 
Now 
125 125 125 
X (6k +7) = ae a 
k=l 
125 125 
= 2? 
k=1 
= 6 x 4(125)(126) +7 x 125 
= 47 250 + 875 
= 48 125. 
Similarly, 
64 
X (6k +7) = S+ 
k=1 
= SDA 
k=1 k=1 
= 6 x $(64)(65) +7 x 64 
= 12480 + 448 
= 12928. 
Hence 
125 
X (6k +7) = 48 125 — 12928 = 35 197. 
k=65 
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Solution to Activity 31 


(a) 24 (2)? +(4)°+--= 50 (2)" 


Solution to Activity 33 
(a) (e+5)? =e + 2ex% 5+ 5? 
=c?+10c + 25 
(b) (1—3a)? = 17 +2 x 1 x (—3a) + (—32)? 
= 1-62 + 92? 
(c) (P — 97)? = (p*)? + 2p?(—¢?) + (-¢’)? 
= pf — 2p + 4t 


Solution to Activity 34 
(a) By formula (12), 
(5 +p)’ =53 +3 x5 xp+3x5 xp +p 
= 125 + 75p + 15p? + p’. 
(b) By formula (12), 
(1 — 2r)? = 18 +3 x 1? x (—2z) 
+3 x 1 x (—2z)? + (2x)? 
= 1 — 6x + 12x? — 82°. 
(c) By formula (12), 
(2x + 3y)? = (2x)? +3 x (2x)? x 3y 
+3 x 2z x (3y)? + (3y)? 
= 8z? + 36x7y + 54ry? + 27y’. 
Solution to Activity 35 
(a) The row is 1, 7, 21, 35, 35, 21, 7, 1. 
(b) This gives the formula 
(a +b)” =a" + 7afb + 21a°b? + 354b’ 
+ 35a3b* + 21a7b° + Tab? + b7. 
Solution to Activity 36 
(a) We use the formula 
(a+b)? 
= a + 5a*b+ 10a3b? + 10a7b? + 5ab* + b°. 
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Substituting a = 3x and b = 2 gives 
(3z + 2)° 
= (3x)? + 5(3x)* x 2 + 10(3x)3 x 2? 
+ 10(32)* x 2? + 5(3z) x 24 + 2° 


= 243r° + 810x4 + 1080x° + 720x? + 240z + 32. 


(b) We use the formula 
(a +b)! = af + 4a?b + 6a?b? + 4ab? +6". 
Substituting a = x and b = —4y gives 
(x — 4y)* 
= zf + 4r? (—4y) + 62?(—4y)? 
+ 4x(—4y)* + (—4y)* 
= xf — 16x y + 96x7y? — 256xy? + 256y*. 


Solution to Activity 37 


8x7x6x5 
(4 = —___—_= 70) 
(a) "C4 4x3x2x1l 
12x 11 
12 — 120, — — 66 
(b) **C x Co axl 6 
5! 
(c) "Cae 
5 5! 
(d) Co = oni =! 


21 
(e) at C0 = ma = T = 21 


Solution to Activity 38 
(a) 2°Cig = 177100 
(b) 32C17 = 565 722 720 


Solution to Activity 39 


(a) We put n = 10, a = 1 and b = z in the binomial 
theorem. The first four terms in the binomial 
expansion of (1+ 2)! are 

110 4.100, x 1z + Cy x 182? + C3 x 1723 

10x9 > 10x9x8 3 

a a 7 
=1+10r + 452? + 1202. 

(b) Here n = 10 (as in part (a)), a = 2 and b = $a. 
The binomial coefficients needed were obtained 
in part (a). The first four terms in the binomial 
expansion of (2 + 32)'° are 


219 4.10 x 29(4x) +45 x P (ir) + 120 x 27(2a)° 


= 1024 + 2207 + 128027 + H r. 


Solutions to activities 


Solution to Activity 40 (i) For the term in h, we need 
(a) By the binomial theorem, each term in the 15 — 3k = 3, 
expansion is of the form which gives 12 = 3k; that is, k = 4. Hence 
110, qll-kpk, the coefficient of h? is 
The term in afb? is obtained when k = 5. Hence me ey = 1365 x = = me one 
the coefficient of a®b° is ne us 
i 11! 11x10x9x8x7 ies (ii) For the term in h~!”, we need 
oS!  5x4x3x2x1 l 15 — 3k = —12, 
(b) By the binomial theorem, each term in the which gives 27 = 3k; that is, k = 9. Hence 
expansion is of the form the coefficient of h~!? is 
200, (30): (—d)* _ 200), (—1)*320-* 20-k ge 150 (-3)° — 5005 x > 
The term in c°d' is obtained when k = 15. 98513415 
Hence the coefficient of d! is = = ae 
2005(—1)1535 = —2°Cy, x 35 7 — 
(iii) For a term in h*, we need 
= —15 504 x 243 
15 — 3k =2, 
= —3767 472. ; ; ; f 
which gives 13 = 3k. Since there is no 
Solution to Activity 41 integer value of k that satisfies this 


equation, there is no term in h?. 
(a) The constant term arises when the power of x a : 


and the power of —1/a in the expansion are the Solution to Activity 42 


same, namely, 6. By the binomial theorem, this Weta = 17 dad aepince a be =o tw ie son 


term is 1\6 in the box. The first six terms of the expansion are 
2o x zê (-3) = coo 1 4: 170, (—2) + 17C3(—2)? J Ona? 
_ 12x 11x 10x9x8x7 + C4(—a)* + C5 (-a)? 
17 x 16 17 x 16 x 15 
6x5x4x3x2x1 1-179 + x z2 17X 16x15 3 
= 924. 2! 3! 
(b) By the binomial theorem, each term in the + eee 


expansion is of the form : 
17x 16 x 15 x 14x13 5 
PE 


3 \F 
15, x Ald-k (- ) 5! 


2h2 
: = 1 — 17x + 136x? — 680x? + 238024 — 618875. 
— O x pid-k (—3) Gis 


= bo, ey pid-k pj 2k 


= 50O, Gay" pid—3k. 
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Unit 11 


Taylor polynomials 


Introduction 


You may have wondered how a calculator or mathematical software 
package finds an approximate numerical value for In 3, e!/? or sin(0.2), for 
example. There are various ways in which this can be done, but one 
common method involves approximating functions such as the natural 
logarithm function, the exponential function or the sine function by 
polynomial functions. 


Recall from Unit 3 that a polynomial function has the form 


f(x) =a sum of terms, each of the form cx’, where k is a 


non-negative integer and c is a constant. 
In other words, it has the form 
f(x) = co tee + coe? +--+ + Cna”, 


where n is a non-negative integer and co, C1, C2,...,Cn are constants 
(possibly 0). The phrase ‘polynomial function’ is often abbreviated to 
polynomial, and we’ll use this abbreviation frequently in this unit. If 
Cn # 0, then the polynomial has degree n. For example, 


f(x) = 6 — 3x + 227 + x’ 
is a polynomial of degree 3. 


To find the value of a polynomial function at a particular input value, the 
only operations that you have to use are addition, subtraction and 
multiplication. For example, the value of the polynomial function f above 
when x = 4 is 


f(4)=6-3x4+2x4 +4 
= 6 — 12 + 32 + 64 = 90. 


By approximating the natural logarithm function by a polynomial 
function, a computer can evaluate ln 3, for example, to the accuracy of the 
computer, using just the operations of addition, subtraction and 
multiplication. Similarly, by approximating the exponential function and 
the sine function by polynomial functions, it can evaluate e!/2 and sin(0.2). 


In this unit you’ll study a particular way of approximating functions by 
polynomials, called Taylor polynomials. By using suitable Taylor 
polynomials, you can approximate many functions to any required level of 
accuracy. In fact, calculators and software packages don’t use Taylor 
polynomials to approximate functions, since more efficient (though also 
more complicated) polynomial methods exist, but by studying Taylor 
polynomials you'll learn about the basic ideas of polynomial 
approximation. 


In most cases a function can’t be approximated by a polynomial function 
over the whole of its domain. What we’ll be interested in throughout the 
unit is the approximation of a function by a polynomial function close to a 
particular point (value) in its domain. 
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Another reason why polynomial approximations are important is that it is 
straightforward to multiply polynomials together, and to differentiate and 
integrate them. Also, polynomial approximations allow complex problems 
to be described by simple mathematical models, making these problems 
easier to understand and to solve. 


Taylor polynomials are also of theoretical importance. They lead naturally 
to a way of representing functions by infinite series, at least for some 
points in their domains. Such representations are called Taylor series. 


In Section 1 you’ll study the approximation of functions by linear and 
quadratic polynomial functions. This is extended to approximation by 
Taylor polynomials of higher degree in Section 2, and then to Taylor series 
in Section 3. Finally, in Section 4, you’ll see various methods for using 
known Taylor series to derive Taylor series for further functions. 


Taylor series and Taylor polynomials are named after the English 
mathematician Brook Taylor, who was educated at home and then at 
St John’s College, Cambridge. In 1715 he published Methodus 
Incrementorum Directa et Inversa, which includes the work on which 
this unit is based, as well as the technique for integration by parts, 
which you studied in Unit 8. The importance of Taylor polynomials 
remained largely unrecognised until much later in the eighteenth 
century. Taylor was elected as a Fellow of the Royal Society in 1712, 
and was appointed to the committee for adjudicating the claims of 
Newton and Leibniz to have invented the calculus. He also wrote 
works on perspective and was a talented musician and artist. 


Taylor was not, in fact, the first person to discover Taylor series. The 
Scottish mathematician James Gregory discovered them more than 
forty years before Taylor, and several other mathematicians, including 
Newton and Leibniz, also independently discovered versions of them 
before Taylor published his work. However, Taylor was the first to 
appreciate their fundamental significance and applicability. 


1 Taylor polynomials of small degree 


James Gregory (1638-1675) 


In this section you'll look at how you can approximate many functions by 
polynomial functions of degrees 0, 1 and 2. 


Here, and throughout the unit, we’ll usually use f to denote a function 
that is to be approximated, and p to denote an approximating polynomial 
function. We’ll usually use a to denote a point in the domain of f close to 
which we want to approximate f. Note that we’ll usually refer to numbers 
in the domain of a function f as points. You met this use of the word 
‘point’ in Unit 6. 
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1.1 Constant Taylor polynomials 


Let’s start by considering how you could approximate a function f, close 
to a particular point a in its domain, by the simplest type of polynomial 
function, namely, a constant function. Remember from Unit 3 that a 
constant function is a function of the form p(x) = c, where c is a 
constant. Its graph is a horizontal line. Approximating a function by a 
constant function is rarely useful, but it illustrates the ideas, and it’s the 
first step in obtaining better approximating polynomials, as you’ll see. 


For example, suppose that you want to approximate the function 

f(x) = sing, close to the point 7/6 in its domain, by a constant function p. 
The best constant function to choose is the one whose graph is the 
horizontal line through the point with x-coordinate 7/6 on the graph of f, 
as illustrated in Figure 1. Since f(m/6) = sin(7/6) = 4, this constant 
function is the function p(x) = 5. 


Yr y = sine 


Figure 1 The graph of the function f(x) = sin x approximated by a 
constant function near z = 77/6 


You can see that, for values of x close to 7/6, the value of p(x) is close to 
the value of f(x). So the value of p(x) can be used as an approximation to 
the value of f(x). The approximation is better when «x is closer to 7/6 
than when it is further away. 


For example, the point 7/4 is fairly close to 7/6, and the approximating 
polynomial p(x) = i gives the following approximation for sin(7/4): 


4 2 
The true value of sin(7/4) is 
T 1 
sin | — ) = — = 0.707 (to 3 d.p.). 


In general, consider any function f that is continuous at a point a in its 
domain. Informally, this means that you can draw the part of the graph 
of f that corresponds to values of x slightly less than a to values of x 
slightly greater than a without taking your pen tip off the paper. Then the 
constant function p that best approximates f close to a is the constant 
function whose graph is the horizontal line though the point (a, f(a)), as 
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illustrated in Figure 2. In other words, it’s the constant function p(x) = c 
where c = f(a). We say that this function p is the constant Taylor 
polynomial about a for f. 


y = f(z) 


YA 


Figure 2 The graph of a function f approximated by a constant function 
near z =a 


For example, you have just seen that the constant Taylor polynomial 


about 7/6 for f(x) = sin x is p(x) = 5. 


If f is any function and p is any approximating polynomial for f, then you 
can obtain an indication of how good the approximation is at any 
particular value of x by subtracting the approximating value p(x) from the 
actual value f(a). The resulting value is known as the remainder at x. 
For example, consider again the function f(x) = sing and the 
approximating polynomial p(x) = 5: If x = 7/4, then the remainder is 


Z) T = 0.207... 


1 
f(z) — p(2) = sin (4) — 5 = 33 
A remainder can be positive, negative or zero, depending on whether f(z) 
is larger than, smaller than, or equal to p(x). Essentially, the remainder is 
the size of the vertical gap between (x, f(x)) and (x, p(x)), with the 
appropriate sign, as illustrated in Figure 3. The smaller the magnitude of 
the remainder, the better the approximation. 


y = f(z) 
YA 
positive remainder 
negative | (x, p(x)) y = p(x) 
remainder | 
| 
| 
| 
| > 
a T T 


Figure 3 Remainders for a function f and an approximating constant 
function p 
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Example 1 Finding and using a constant Taylor polynomial 
(a) Find the constant Taylor polynomial about 0 for the function 


f(z) = e*. 

(b) Use this constant Taylor polynomial to write down 
approximations for e®°! and e®!. In each case, use your 
calculator to find the value of the associated remainder to five 
decimal places. 


Solution 
(a) Since f(0) = e? = 1, the constant Taylor polynomial about 0 for 
(aS ist) — 
(b) The approximation for e°?! given by p is 
p(0.01) = 1, 
with remainder 
f(0.01) = p(0.01) =e"! —1=0.01005 (to5 dp). 
Similarly, the approximation for e°! given by p is 
p(0.01) = 1, 
with remainder 


FPOD=p01 == =O 10517 ico oda! 


The constant Taylor polynomial found in part (a) of Example 1 is shown 
in Figure 4. The results of part (b) of the example illustrate the fact that 
the approximation provided by the constant Taylor polynomial is better 
(has a remainder of smaller magnitude) for values of x closer to 0 than for 
those further from 0. You can also see this from the graph in Figure 4, 
since the gap between the two graphs decreases in magnitude as the value 
of x moves towards 0. 


Figure 4 The graph of f(x) = e” and its constant Taylor polynomial 
about 0 
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In the next activity, and in the later activities in this unit, your calculator 
should be in radian mode when youw’re calculating the values of 
trigonometric functions. We’ll use radians, rather than degrees, 
throughout the unit. This is because we’ll be working with derivatives, and 
the standard formulas for the derivatives of trigonometric functions hold 
only when angles are measured in radians. 


Activity 1 Finding and using constant Taylor polynomials 


(a) Find the constant Taylor polynomial about 0 for the function 
f(x) = cosa. Use this polynomial to write down approximations for 
cos(0.01) and cos(0.1). In each case, use your calculator to find the 
value of the associated remainder to five decimal places. 


(b) Find the constant Taylor polynomial about 1 for the function 
f(x) =Inz. Use this polynomial to write down approximations for 
In(1.01) and In(1.1). In each case, use your calculator to find the value 
of the associated remainder to five decimal places. 


The constant Taylor polynomials from Activity 1 are shown in Figure 5. 


YA YA 


Y = COST 


(a) (b) 


Figure 5 The graphs of two constant Taylor polynomials 


As mentioned earlier, approximating a function f, close to a point a in its 
domain, by a constant Taylor polynomial p is usually not very useful. 
Unless x is extremely close to a, the accuracy of p(x) as an approximation 
for f(x) is not impressive! 


This is illustrated by the fact that none of the approximating constant 
Taylor polynomials in Figures 1, 4 and 5 look particularly close to the 
original functions, as the value of x moves away to either side of the 

point a. However, it could be claimed that the Taylor polynomial in 
Figure 5(a) appears to be a better approximating polynomial than the 
others. In this case the function f has a local maximum at xz = a, and 
hence the tangent to the graph of f at (a, f(a)) is horizontal, and so 
coincides with the graph of the constant Taylor polynomial, p(x). So in 
this case the approximating polynomial p not only has the same value as 
the function f at x = a, but also its graph has the same gradient at z = a. 


1 Taylor polynomials of small degree 


We'll use this idea to obtain better approximating polynomials in the next 
subsection. 


1.2 Linear Taylor polynomials 


In this subsection we’ll continue to look at how we can approximate a 
function f, close to a particular point a in its domain, by a simple 
polynomial function p. In the previous subsection we chose the 
approximating polynomial p to be a constant function; that is, a function 
of the form 


p(x) =c, 


where c is a constant. We chose the value of c to be f(a), to ensure that 
the function f and the approximating polynomial p have the same value as 
each other at x = a. 


Here we'll choose the approximating polynomial p to be a linear 
function, that is, a function of the form 


p(z) =ma+e, 


where m and c are constants. As you know, the graph of such a function is 
a straight line. We’ll choose p to have the property that not only do the 
function f and the approximating polynomial p have the same value 

at x =a, but also their first derivatives have the same value at x = a. The 
second condition ensures that the graphs of f and p have the same 
gradient at x =a. 


In order for this to be possible, the function f must not only be continuous 
at a, but also differentiable at a. If you know that f is differentiable at a, 
then you don’t have to check separately that it’s continuous at a, as that 
follows automatically. This is because, as you saw in Unit 6, if a function 
has a discontinuity at a, then it isn’t differentiable at a. 


The function p obtained as described above is called the linear Taylor 
polynomial about a for f. Its graph is the tangent to f at a, as 
illustrated in Figure 6. Usually a linear Taylor polynomial gives better 
approximations than a constant Taylor polynomial. In some texts, linear 
Taylor polynomials are called tangent approximations. 


YA 


Figure 6 The graph of a function f and its linear Taylor polynomial 
about a 
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To illustrate the ideas, let’s find the linear Taylor polynomial about 0 for 
the exponential function f(x) = e”. We’ll denote this approximating 
polynomial by p, as usual. 


The graph of p is the straight line that passes through the point with 
x-coordinate 0 on the graph of the function f(x) = e”, and has the same 
gradient at that point, as shown in Figure 7. 


YA 
204 


Figure 7 The linear Taylor polynomial about 0 for f(x) = e” 


The point with z-coordinate 0 on the graph of f(x) = e” is (0,e?) = (0,1). 
Also, the derivative of the function f(x) = e” is f'(x) = e”, so the gradient 
of the graph of f at the point (0,1) is f’(0) = e? = 1. So the graph of the 
approximating polynomial p is the straight line that passes through the 
point (0,1) and has gradient 1. From your work in Section 2 of Unit 2, you 
know that the straight line with gradient m that passes through the point 
(21, yi) has equation 


y — yı = m(z — zı), 

so the straight line required here has equation 
w-1)=1(z-0), 

which can be simplified to 
y=1+r. 

Thus the linear Taylor polynomial about 0 for f(x) = e” is 
plz)=1+z. 


You can see from Figure 7 that, as you’d expect, the graph of p(x) =1+2 
approximates that of f(a) = e” near x = 0 more closely than was the case 
for the graph of the constant Taylor polynomial for f(x) = e” about 0 in 
Figure 4. 


As for constant Taylor polynomials, the approximation to f(x) provided 
by p(x) is better for values of x close to 0 than for values of x further away 
from 0, since the gap between the graphs of f and p increases as the value 
of x moves away from 0 on either side. 


1 Taylor polynomials of small degree 


You can use the method above to work out the linear Taylor polynomial 
for any function f about any point a at which its graph has a gradient. 
However, a better way to proceed is to apply the method to a general 
function f and a general point a. This will give a general formula that you 
can use to work out a linear Taylor polynomial in any particular case. 


To do this, let’s suppose that f is a function and a is a point in its domain 

at which it’s differentiable. The point on the graph of f with 

x-coordinate a is (a, f(a)). Also, the gradient of the graph of f at the point 

(a, f(a)) is f'(a). So the graph of p is the straight line that passes through 

the point (a, f(a)) and has gradient f'(a). This straight line has equation 
y — f(a) = f'(a)\(@—a), 


which can be rearranged as 


y = f(a) + f'(a)(z — a). 


So we have the following general formula. 


Linear Taylor polynomials 


Let f be a function that is differentiable at a. The linear Taylor 
polynomial about a for f is 


p(x) = f(a) + f'(a) (x — a). 
When a = 0, this becomes 


p(x) = f(0) + f'(0)x. 


The particular case when a = 0 is stated separately in the box because this 
case occurs commonly and is simpler to work with. 


In the next example the linear Taylor polynomial about 0 for the 
exponential function is worked out again, but this time directly using the 
formula above. The linear Taylor polynomial is also used to find an 


approximation for e®!. 


Example 2 Finding a linear Taylor polynomial about 0 A 

(a) Find the linear Taylor polynomial about 0 for the function — 
f(x) =e". 

(b) Use this polynomial to find an approximation for e°!. Use your 


calculator to find the value of the associated remainder to five 
decimal places. 
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Solution 


(a) ®. Differentiate f to find f’, and hence find the values of f(0) 
and f’(0). © 


We have f(x) = e”, so 
ce 
Hence 
FO) =e =1 ‘and OTe 


@. Apply the second formula in the box above, since in this 
case a = 0. ®@ 


The linear Taylor polynomial about 0 for f(x) = e” is 
p(x) = OE O 
@. Substitute in the values of f (0) and f’(0). ® 
that is, 
PO S= Er 


(b) The approximation for e°! given by the linear Taylor 
polynomial p is 


pO) = ew le 
The remainder for this approximation is 


ec? 1. = 0.005a% (to 5 dp): 


The remainder found in Example 2(b) is about 20 times smaller than the 
remainder 0.10517 found in Example 1(b), where e®-! was approximated 
by a constant Taylor polynomial. So the linear Taylor polynomial about 0 
for f(x) = e” provides a much more accurate approximation for the value 
of e°-! than was obtained using a constant Taylor polynomial. The same 
applies when you approximate e” for any other value of x close to 0. 


In the next activity you’re asked to find a linear Taylor polynomial for the 
sine function. 


Activity 2 Finding a linear Taylor polynomial about 0 


(a) Find the linear Taylor polynomial about 0 for the function 
f(z) Semis: 


(b) Use this polynomial to find approximations for sin(0.25) and sin(0.5), 
each to four decimal places. By comparing these approximations with 
the values obtained from your calculator, show that the magnitude of 
the remainder at x = 0.5 is much larger than that at x = 0.25. 


1 Taylor polynomials of small degree 


Notice from Activity 2(a) that the linear Taylor polynomial about 0 for the 
sine function contains no constant term; it is p(x) = x. This happens 
because the graph of the sine function passes through the origin. The 
graphs of f(x) = sin x and p(x) = x are shown in Figure 8. 


Figure 8 The linear Taylor polynomial about 0 for f(x) = sin x 


Activity 3 Finding another linear Taylor polynomial about 0 


(a) Find the linear Taylor polynomial about 0 for the function 
je) = cose. 


(b) Use this polynomial to find an approximation for cos(0.2), and use 
your calculator to find the value of the associated remainder to four 
decimal places. 


Notice from Activity 3(a) that the linear Taylor polynomial about 0 for the 
cosine function contains no term in x and is therefore a constant function; 
it is p(x) = 1. This happens because the graph of the function f(x) = cos x 
has gradient zero at the point where x = 0. The graphs of f(x) = cosx 
and p(x) = 1 are shown in Figure 9. 


YA 


| 
2 
—]l-4 


Figure 9 The linear Taylor polynomial about 0 for f(x) = cos x 
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In fact, the linear Taylor polynomial p in this case is the same as the 
constant Taylor polynomial about 0 for the cosine function, which you 
found in Activity 1(a) and whose graph is in Figure 5(a). In Activity l(a), 
you started by looking for (and finding) a constant function, whereas in 
Activity 3(a) you started by looking for a linear function, which turned out 
to be one in which the coefficient of x is zero. 


In the next activity you’re asked to use a linear Taylor polynomial to 
approximate a square root. 
Activity 4 Finding and using another linear Taylor polynomial 


(a) Show that the linear Taylor polynomial about 0 for the function 
f(z) = (1+ z)"?, 


p(z) =1+ iz. 


(b) Use the polynomial p from part (a), with x = 0.01, to find an 
approximate value for v1.01. Use your calculator to find, to six 
decimal places, the value of the associated remainder. 


Linear Taylor polynomials about a Æ 0 


All the linear Taylor polynomials that you’ve seen so far in this subsection 
have been about 0. The next example and activity relate to linear Taylor 
polynomials about another point. 


1 Taylor polynomials of small degree 


The graphs of the function f(x) = Ina — 1/x and the linear Taylor 
polynomial p(x) = —3 + 2x that was found in Example 3 are shown in 
Figure 10. You can see that p(x) is an approximation to f(x) for values 
of x close to 1. 


Figure 10 The linear Taylor polynomial about 1 for f(x) = Ina — 1/x 
Activity 5 Finding a linear Taylor polynomial about a point other 


than 0 


Find the linear Taylor polynomial about 1 for the function f(x) = e”. 
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In Activity 5 you were asked to obtain the linear Taylor polynomial 

about 1 for the function f(a) = e”, while in Example 2(a) on page 105 the 
linear Taylor polynomial about 0 was obtained for the same function. The 
graphs of these linear Taylor polynomials are shown in Figure 11. As you’d 
expect, it appears that the first of these polynomials approximates e” for 
values of x close to 1, while the second approximates e” for values of x 
close to 0. This illustrates that, in general, Taylor polynomials about 
different points are different polynomials. 


Yr ye 


ax 


Figure 11 The linear Taylor polynomials about 0 and 1 for f(x) =e 


In Subsection 1.1 we approximated functions by constant Taylor 
polynomials, and in this subsection we approximated them by linear 
Taylor polynomials. The linear Taylor polynomials usually provided better 
approximations than the constant Taylor polynomials. This suggests that 
we could obtain further improvements by increasing the degree of the 
approximating polynomial once more, and trying to approximate functions 
by polynomials of degree 2, which are quadratic functions. This is the 
topic of the next subsection. 


1.3 Quadratic Taylor polynomials 


We now look at approximating functions by quadratic functions. As you’d 
expect, this usually gives greater accuracy than approximating functions 
by linear functions. 


Suppose that f is a function that’s differentiable at a. In Subsection 1.2 
you saw how to approximate f close to a by a linear function p. The 
particular linear function p was chosen to ensure that the following two 
conditions hold: 


1. The values of the function and the approximating polynomial are 
equal at a; that is, p(a) = f(a). 


2. The values of the first derivatives of the function and the 
approximating polynomial are equal at a; that is, p'(a) = f'(a). 


1 Taylor polynomials of small degree 


Suppose that we now want to try to approximate f close to a by a 
quadratic function p. As you know, a quadratic function is a function of 
the form 


p(z) =cot ae + CX", 


where co, Cc, and cg are constants. However, in the context of Taylor 
polynomials, it’s more convenient to write the general form of an 
approximating quadratic function as 

p(x) = cy + a(x — a) + c2(2 — a)’, 
where cg, c1 and c2 are constants, and a is the point about which we want 
to find the approximating function. This alternative form is the rule of a 
quadratic function, since if you multiply out the right-hand side then you 
obtain powers of x up to and including x”, but no higher powers. You can 


see that the form above reduces to the more usual form in the case when 
a= 0. 


It seems sensible to choose the approximating quadratic function p to 
ensure that conditions 1 and 2 above hold. Since there are now three 
constants to choose, we can also impose a third condition, and a natural 
one to choose is: 


3. The values of the second derivatives of the function and the 
approximating polynomial are equal at a; that is, p’(a) = f” (a). 


We can impose this condition provided that f”(a) exists; that is, provided 
that f is twice differentiable at a. This is the case for many functions and 
many points in their domains. In fact, many functions can be 
differentiated as many times as you wish at all points in their domains. 
Such functions include all polynomial, rational, trigonometric, exponential 
and logarithmic functions, and all constant multiples, sums, differences, 
products, quotients and composites of these. 


As you've seen, condition 1 means that the graphs of the function f and 
the approximating polynomial p both pass through the same point 

(a, f(a)), and condition 2 means that the graphs of the function and the 
approximating polynomial both have the same gradient at that point. 
Condition 3 means that the function and the approximating polynomial 
also have the same rate of change of gradient at that point. Roughly 
speaking, this means that their graphs have the same ‘curvature’ at that 
point. 


The polynomial p of the form above that satisfies conditions 1, 2 and 3 is 
called the quadratic Taylor polynomial about a for f. For any 
point x close to a, the value of p(x) is an approximation for f(x). 


In some cases, the polynomial p(x) that satisfies conditions 1, 2 and 3 has 
co = 0 and so is not a quadratic polynomial, but has degree 1 or less. If 
this happens, then we still refer to the approximating polynomial as the 
quadratic Taylor polynomial about 0 for f. This means that a quadratic 
Taylor polynomial is not necessarily a quadratic polynomial! You'll see an 
example of this later in this section. You saw in Subsection 1.2 that a 
similar situation arises with linear Taylor polynomials (a linear Taylor 
polynomial can be a constant function). 
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As for linear Taylor polynomials, there’s a general formula that you can 
use to find quadratic Taylor polynomials. It’s given in the box on 

page 114. If you’re not interested in knowing where the formula comes 
from, then you can skip ahead to this box. Otherwise, keep reading! 


To illustrate the ideas of how the formula is derived, let’s start by finding a 
particular quadratic Taylor polynomial, namely the quadratic Taylor 
polynomial about 0 for the exponential function f(x) = e”. As discussed 
above, we can take it to be of the form 


p(x) = co + c1(a — a) + ¢2(a — a)’, 
where co, &1 and c2 are constants. In this case, since a = 0, it reduces to 
p(x) = co +a + cnx. 


We have to determine what the values of the constants co, cy and cg must 
be to ensure that the value of p at 0, and the values of the first and second 
derivatives of p at 0, are the same as those of f. 


The first and second derivatives of the function f(a) = e” are f'(x) = e” 


and f(x) = e”. Hence 
fO)=e=1, f(0)=—e=1 and f(O =e =1. 


So we have to choose the values of the constants cg, cj and c2 to ensure 
that 


p(0)=1, p'(0)=1 and p”(0)=1. 
Here’s how we can do that. 
First we ensure that p(0) = 1. We have 
p(z) = co + c12 + er’, 
so p(0) = co. Thus to ensure that p(0) = 1 we must have co = 1. 
Next we ensure that p'(0) = 1. Differentiating the formula for p gives 
p (x) = & + 2coz, 
so p'(0) = cı. Thus to ensure that p’(0) = 1 we must have cı = 1. 
Finally we ensure that p’(0) = 1. Differentiating the formula for p’ gives 
p(x) = 2c, 


so p” (0) = 2c2. Thus to ensure that p”(0) = 1 we must have 2c = 1; that 
. 1 
is, Co = 5. 


So the quadratic Taylor polynomial p for f(x) = e” about 0 is 
p(z) =14+a+ ła’. 


The graphs of f(x) = e” and the approximating polynomial 

p(x) = 1 +x + $2 found above are shown in Figure 12. You can see that 
the quadratic function p appears to be a more accurate approximating 
polynomial for f(x) = e” for values of x close to 0 than the linear function 
found earlier, which is shown in Figure 7 on page 104. 
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Figure 12 The quadratic Taylor polynomial about 0 for f(x) = e” 


As with the constant and linear Taylor polynomials about 0 for f(x) = e” 
that you met earlier, which are p(x) = 1 and p(x) = 1+ 2, respectively, the 
quadratic Taylor polynomial p(x) = 1+ £ + ir? provides better 
approximations for values of x close to 0 than for values of x further away. 


You could use the method demonstrated above to work out the quadratic 
Taylor polynomial for any function f about any point a at which it is twice 
differentiable. However, let’s instead apply the method to a general 
function f and general point a. This will give a general formula that we 
can use to work out a quadratic Taylor polynomial in any particular case. 


So let’s suppose that f is a function and a is a point in its domain at 
which it is twice differentiable. 


The quadratic Taylor polynomial about a for f is of the form 
p(x) = co + a (z — a) + c2(z — a)’, 


where co, cj and cg are constants. We have to determine what the values of 
the constants co, c1 and cp must be to ensure that the value of p at a, and 
the values of the first and second derivatives of p at a, are the same as 
those of f. 


To do this, we apply the method demonstrated above, the only difference 
being that we can’t evaluate the quantities f(a), f'(a) and f”(a), so 
instead we keep them in their general form throughout our working. 
They’ll then appear in the final formula for p, ready to be evaluated for 
any particular function f and point a. 


As before, we have to choose the values of cp, cı and c2 to ensure that 
pla) = f(a), p'(a)= f(a) and p”(a)= f"(a). 

To do that, first we ensure that p(a) = f(a). We have 
p(z) = co + c (z — a) + (a — a)’, 

so p(a) = co. Thus to ensure that p(a) = f(a) we must have co = f(a). 
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Next we ensure that p'(a) = f'(a). The first step here is to differentiate the 
formula for p, 


p(x) = co + a (z — a) + co(x — a)’. 


The first term, co, is a constant, so its derivative is 0. To differentiate the 
second term, cı(x — a), we use the constant multiple rule, which gives c1. 
To differentiate the third term, c2(a — a)?, we use the constant multiple 
rule and the chain rule (or the rule for differentiating a function of a linear 
expression), which gives 2c2(x — a). The final answer is 

p(x) = c + 2c9(x — a). 
Hence p'(a) = ci. Thus to ensure that p'(a) = f'(a) we must have 
cı = f'(a). 
Finally we ensure that p”(a) = f”(a). Differentiating the formula for p’, 
using a method similar to that used to differentiate the formula for p 
above, gives 

p" (£) = 269, 
Hence p”(a) = 2c2. Thus to ensure that p”(a) = f”(a) we must have 
2c = f" (a); that is, c2 = 5f"(a). 


So we have the following general formula. 


Quadratic Taylor polynomials 


Let f be a function that is twice differentiable at a. The quadratic 
Taylor polynomial about a for f is 


pa) = fla) + f@e—4) +5f' @e-a). 
When a = 0, this becomes 
p(x) = f(0) + f'(O)a+ $f" (O)z?. 


This formula allows you to find the quadratic Taylor polynomial p about a 
for any suitable function f. To do this, you calculate the value of f at a, 
and the values of the first and second derivatives of f at a, and then 
substitute them into the formula. 


In the next example the quadratic Taylor polynomial about 0 for the 
exponential function is worked out again, this time directly using the 
formula in the box above. 


Example 4 Finding a quadratic Taylor polynomial about 0 


Find the quadratic Taylor polynomial about 0 for the function 
O= e 


1 Taylor polynomials of small degree 


You can see that the quadratic Taylor polynomial about 0 for the function 
f(x) = e” found in Example 4 is the same as that found earlier in this 
subsection, as expected. 


In the next two activities you can use the second formula in the box on 
page 114 to find the quadratic Taylor polynomials about 0 for the cosine 
and sine functions. 


Activity 6 Finding a quadratic Taylor polynomial about 0 


(a) Find the quadratic Taylor polynomial about 0 for the function 
(2) = cos z. 


(b) Use this polynomial to find an approximation for cos(0.2), and use 
your calculator to find the value of the associated remainder to six 
decimal places. Compare this approximation for cos(0.2) with the one 
found in Activity 3(b). Which is better? 


The outcome of Activity 6(b) demonstrates that increasing the degree of 
the approximating Taylor polynomial from 1 to 2 can significantly increase 
the accuracy of an approximation at a particular point. 
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The graphs of f(x) = cosx and the quadratic Taylor polynomial 

p(x) =1—- ir? found in Activity 6(a) are shown in Figure 13. You can see 
that p(x) appears to be a good approximation for f(x) for values of x in 
quite a large interval around 0. 


YA 


1 
l l T 
=9 2 
=H 
il ig? 


Figure 13 The quadratic Taylor polynomial about 0 for f(x) = cos æ 


Activity 7 Finding another quadratic Taylor polynomial about 0 


Find the quadratic Taylor polynomial about 0 for the function 


f(x) = sinz. 


The quadratic Taylor polynomial about 0 for the sine function, p(x) = x, is 
an example of a quadratic Taylor polynomial that is not a quadratic 
polynomial. The quadratic Taylor polynomial about 0 for the sine function 
is the same as the linear Taylor polynomial about 0 for the sine function, 
whose graph is shown in Figure 8 on page 107. 


We have now found constant, linear and quadratic Taylor polynomials 
about 0 for each of the functions f(x) = e”, f(x) = sinx and f(x) = cos z. 
These are listed in Table 1. 


Table 1 Constant, linear and quadratic Taylor polynomials about 0 


Function Constant Linear Quadratic 


ev 1 1l+a 1+a4 52? 
sin x 0 g x 
COS £ 1 1 1— $2? 


Notice that, for each of these three functions, the linear Taylor polynomial 
about 0 can be obtained from the constant Taylor polynomial about 0 by 
adding the appropriate term in x (this term is Ox in the case of cos). 
Similarly, the quadratic Taylor polynomial about 0 can be obtained from 
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the linear Taylor polynomial about 0 by adding the appropriate term in x? 


(this term is 0x? in the case of sin). You will see in Section 2 that these 
properties hold for every function f for which these Taylor polynomials 
can be found, and that similar properties hold for higher-degree Taylor 
polynomials about 0. When you’re calculating Taylor polynomials, this is 
a very convenient feature. 


Quadratic Taylor polynomials about a + 0 


So far in this section, you’ve seen quadratic Taylor polynomials only 
about 0. The next example and activity involve quadratic Taylor 
polynomials about another point. 


Example 5 Finding a quadratic Taylor polynomial about a point A 
other than 0 — 
Find the quadratic Taylor polynomial about 1 for the function 

ie) = ing: 

Solution 

@. Differentiate f twice to find f’ and f”, and find the values of f(1), 

f'(1) and y # 

We have f(z) = ln z, so 


1 1 n 1 
Aon. and ars 
Hence 
A) =n, = - = æd j'()= = =-1. 


@. Apply the first formula in the box on page 114. Remember that in 
this case a = 1. ® 


The quadratic Taylor polynomial about 1 for f(x) = ln z is 
2a) — at Oe) > Vay: 
® Substitute in the values of f(1), f’(1) and f”(1). @ 
that is, 
p(w) =0+1@—-1)+5(-)@-1)’, 
which can be simplified to 
DG) = a a 


@. Usually, leave the answer in this form, rather than multiplying 
out. ® 
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You could simplify the quadratic Taylor polynomial found in Example 5 by 
multiplying out the squared brackets and collecting like terms. This gives 


ps) = —3 + 2a — la’. 


However, normally we don’t simplify quadratic Taylor polynomials in this 
way. Instead, we leave them in the form 

p(x) = co +(x — a) + c&(zx — a)”. 
To understand why, remember that a convenient property of Taylor 
polynomials about 0 is that you can obtain the linear Taylor polynomial 
about 0 for a function f by adding the appropriate term in x to the 
constant Taylor polynomial about 0 for f, and similarly for higher-degree 
Taylor polynomials. Taylor polynomials about a point a other than 0 have 
the same property, as long as you use the form above for the Taylor 
polynomials; that is, as long as you consider terms in powers of x — a 
rather than terms in powers of x. You’ll see in the next section why this is 
true for any value of a. 


The graphs of the function f(x) = Inz and the quadratic Taylor 
polynomial p(x) = (x — 1) — 4(a — 1)? found in Example 5 are shown in 


2 
Figure 14. 


Yh y= lnr 


y = p(z) 


Figure 14 The quadratic Taylor polynomial about 1 for f(x) = ln z 


Activity 8 Finding a quadratic Taylor polynomial about a point 
other than 0 


£ 


Find the quadratic Taylor polynomial about 1 for the function f(x) = e”. 


So far you’ve seen how you can use constant, linear and quadratic Taylor 
polynomials about a point a for a function f to approximate the values 
of f close to a. You saw that when we increased the degree of the Taylor 
polynomial the accuracy of the approximations was usually improved. In 
the next section yov’ll see that this fact generalises to approximating 
polynomials of higher degree. 


2 Taylor polynomials of any degree 


2 Taylor polynomials of any degree 


In this section you'll look at approximating functions by polynomials of 
any degree, and how you can use such polynomials to find approximate 
values for functions at particular points. 


2.1 Taylor polynomials of degree n 


From what you saw in Section 1, you might guess that for any suitable 
function f and any point a in its domain, you can obtain more and more 
accurate approximating polynomials for f close to a by taking polynomials 
of higher and higher degrees, and choosing the coefficients to ensure that 
the values of higher and higher derivatives of the polynomial at a are the 
same as those of the corresponding derivatives of f. Here ‘suitable 
function f’ means that f must be differentiable at a the required number 
of times. 


For example, to try to improve on the approximations provided by the 
quadratic Taylor polynomial about a for f, you could attempt to 
approximate f by a cubic function, of the form 


p(x) = co + a (x — a) + c2(x — a)? + 3(a@ — a)’, 


whose value is the same as that of f at a, and whose first, second and third 
derivatives have the same values at a as the corresponding derivatives of f. 


More generally, for any chosen value of n, you could try to find a 
polynomial 


p(x) = co +c (x — a) + co(@ — a)? + c3(£ — a)? +--+ cnl — a)”, (1) 


whose value is the same as that of f at a, and whose first, second, third, 
..., nth derivatives have the same values at a as the corresponding 
derivatives of f. The polynomial that satisfies these conditions is called 
the Taylor polynomial of degree n about a for f. 


There’s a general formula for this Taylor polynomial, for any function f, 
any degree n, and any point a at which f is differentiable n times. As 
you’d expect, this formula involves the values of the first, second, third, 
..., nth derivatives of f at a. Remember that for n > 3 the nth derivative 
of the function f is denoted by f™), so the value of the nth derivative of f 
at a is denoted by f(a). (The third derivative f(3) is sometimes denoted 
by f’”.) The general formula is given in the box on page 121. If you’re not 
interested in knowing how it’s derived, then you can skip ahead to it. 
Otherwise, as before, keep reading. 


To find the formula, we’ll use the method that was used for quadratic 
polynomials in the previous section, but we’ll continue with derivatives up 
to the nth, rather than just the second. Similarly to before, we can’t 
evaluate the first, second, third, ..., nth derivatives of f at a, because we 
don’t know what f and a are, so throughout our working we’ll denote 
them by f'(a), f’(a), f(a), ..., f™ (a). These quantities will then 
appear in the final formula for p, ready to be evaluated for any particular 
function f and point a. 
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Here’s how the working goes. We need to determine what the values of the 
constants Cp, C1, C2, ---,; Cn in equation (1) must be to ensure that the 
value of p at a, and the values of the first, second, third, ..., nth 
derivatives of p at a, are the same as those of f. 


First we ensure that p(a) = f(a). We have 
p(x) = co + c& (zx — a) + co(2 — a)? + c3(x — a)? + c4(x — a)* 
+ eale a)”, 
so p(a) = co. Thus to ensure that p(a) = f(a) we must have 
co = f(a). 
Next we ensure that p'(a) = f'(a). We have 
p (x) = cy + 2c2(x — a) + 3c3(x — a)? + 4e4(2 — a)? 


bee 4+ nen(s — a)", 


so p'(a) = cy. Thus to ensure that p'(a) = f'(a) we must have 
cı = f'(a). 
Then we ensure that p”(a) = f” (a). We have 
p(x) = 2c2 +3 x 2c3(x — a) + 4 x 3c4(z — a) 
$e T a), 


2 


so p” (a) = 2c2. Thus to ensure that p”(a) = f(a) we must have 
2co = f(a); that is, 
co =4f"(a). 
It’s useful to write this as 
_ f"@) 
oF 
where 2! denotes 2 x 1, that is, 2 factorial, as you saw in Unit 10. This 


allows the pattern in the next few calculations to be seen more easily. 
(Recall that if n is a positive integer, then, by definition, 


C2 


n! =n(n—1)x---x2x 1.) 
Then we ensure that p®) (a) = f) (a). We have 
p® (£) = 3 x 2c3 +4 x 3 x 2c4(£ — a) 
+- + n(n- 1)(n — 2)en (z — a)’, 


so p) (a) = 3!c3. Thus to ensure that p*(a) = f?(a) we must have 
3!c3 = f(a); that is, 


(3) 
n- PO 
3! 
Continuing in this way, we find that we must have 
(4) (5) 
C4 = j _ — j e and so on, 
until finally, to ensure that p™ (a) = f(a), we must have 
(n) 
„PO 
n! 
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The resulting formula for the approximating polynomial is stated below. 
You can see that it generalises the formulas for constant, linear and 
quadratic Taylor polynomials given earlier. In the case where a = 0 the 
formula reduces to a simpler form, which is also stated below. 


Taylor polynomials 


Let f be a function that is n-times differentiable at a point a. The 
Taylor polynomial of degree n about a for f is 


Mag, B3)(q 
pa) = f(a) + Poe- a) + 29 (@ — a)? + EY (ea) 
na 
P ig ge (2) 


n! 
The point a is called the centre of the Taylor polynomial. 


When a = 0, the Taylor polynomial above becomes 


A (3) (n) 
o S 24, O94...4F°O) an (3) 


Formula (2) allows you to find the Taylor polynomial p of degree n about a 
for any suitable function f. To do this, you calculate the value of f at a, 
and the values of the first, second, third, ..., nth derivatives of f at a, and 
substitute them into the formula. 


The formula also confirms that any Taylor polynomial about a for a 
function f can be obtained from a Taylor polynomial of lower degree 
about a for f by adding the appropriate further terms. For example, the 
Taylor polynomial of degree 1 about a for f is 


p(z) = fla) + f'(a)(z — a), 
while the Taylor polynomial of degree 2 about 0 for f is 


p(x) = f(a) + Foe- a) + 2 (ea), 


The second polynomial is obtained from the first simply by adding the 
term in (x — a)’. 


For some functions f, the value of f™ (a) is 0. In such a case, formula (2) 
gives a polynomial whose degree is less than n. If this happens, then the 
polynomial is still called the Taylor polynomial of degree n about a for f. 
This means that a Taylor polynomial of degree n is not necessarily a 
polynomial of degree n. You’ve seen examples of this already, in the cases 
n = 1 and n = 2, and you'll see more examples later in this section. 


You’ve seen the terms constant, linear and quadratic used to describe 
Taylor polynomials of degrees 0, 1 and 2, respectively. The terms cubic, 
quartic and quintic are used to describe Taylor polynomials of 

degrees 3, 4 and 5, respectively. 


The next example and the two activities that follow concern Taylor 
polynomials about 0. So in these we use formula (3) for a Taylor 
polynomial about 0, rather than the general formula (2). 
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Example 6 Finding a quartic Taylor polynomial about 0 
Find the quartic Taylor polynomial about 0 for the function 
f(a) =e. 
Solution 
@. Repeatedly differentiate f to find f’, f", fO) and f™, and find 
the values of f(0), f’(0), f’(0), f(0) and f“(0). @ 
Here f(0) = e? = 1. Also, for each positive integer n, the nth 
derivative of the function f(x) = e” is f™ (x) = e”, so 
f'(0) =e° =1, forall positive integers n. 


@. Apply formula (3) in the box above, since in this case a = 0. Also, 
ASe lsn Ol and Al e 


Hence, by formula (3), the Taylor polynomial of degree 4 about 0 for 
jee" 16 


1 
p(n) =1+a+ 50° + : 


1 3 1 ` 
aa vpo 


that is, 
p(t) =1+gr + 4x? + 4r’ + Hart. 


Notice that, as you’d expect, the first three terms found in Example 6 
agree with those of the quadratic Taylor polynomial about 0 for the 
exponential function, which was found in Example 4. 


The graphs of the function f(x) = e” and the quartic Taylor polynomial 
about 0 found in Example 6 are shown in Figure 15. Notice how much 
better this approximating polynomial is than the quadratic one shown in 
Figure 12 on page 113. 


Figure 15 The quartic Taylor polynomial about 0 for f(x) = e” 


2 Taylor polynomials of any degree 


Since for the function f(x) = e” we have that f™ (0) = 1 for all positive 
integers n, it follows from formula (3) that, for any n, the Taylor 
polynomial of degree n about 0 for f(x) = e” is 
1 1 1 
a ae a E eae 
In the next activity you’re asked to find the quartic Taylor polynomials 
about 0 for the cosine and sine functions. 


Activity 9 Finding quartic Taylor polynomials about 0 


Find the quartic Taylor polynomial about 0 for each of the following 
functions. 


(a) f(x) = cos z (b) f(x)= sing 


Notice that the constant term, the term in x and the term in z? in the 
Taylor polynomials that you were asked to find in Activity 9 are the same 
as those in the quadratic Taylor polynomials about 0 for the cosine and 
sine functions, which you were asked to find in Activities 6 and 7. 


The quartic Taylor polynomial about 0 for the sine function is an example 
of a Taylor polynomial of degree n whose polynomial degree is less than n. 


The graphs of the cosine and sine functions, and the quartic Taylor 
polynomials for these functions that you were asked to find in Activity 9, 
are shown in Figure 16(a) and (b), respectively. 


7 
[SOE] 
Rv 


U = COST 


(a) (b) 


Figure 16 The quartic Taylor polynomials about 0 for cosine and sine 
In the next activity you’re asked to find a general formula for a Taylor 


polynomial of degree n about 0 for a particular function, by spotting how 
the pattern of derivatives develops. 
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Activity 10 Finding a Taylor polynomial of degree n 


Consider the function 


l-xc 
Use the chain rule (from Unit 7) to show that, if k is a constant, 
then 


d 1 7 k 

dz \(l-—a)F/ (L-g) 
Use the result of part (a)(i) to find formulas for f’, f" and f®), 
and hence find the values of f’(0), f”(0) and f®)(0). 
Hence find the cubic Taylor polynomial about 0 for f. 


Use the result of part (a)(i) to find successive derivatives of f 
beyond the third, until the pattern is clear and you can write 
down a formula for the nth derivative of f, in terms of x and n. 


Hence write down a formula in terms of n for f™ (0), the value of 
the nth derivative of f at 0. 


Hence write down, using the +---+ notation, a formula for the 
Taylor polynomial of degree n about 0 for f. 


The graph of the function f(x) = 1/(1 — x), and the graph of the cubic 
Taylor polynomial about 0 for this function from Activity 10(a), are shown 
in Figure 17. 


y=1+r +r? +r’ 


Figure 17 The cubic Taylor polynomial about 0 for f(x) = 1/(1 — x) 
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Even and odd functions 


In Activity 9 earlier in this subsection you saw that the quartic Taylor 
polynomials about 0 for the cosine and sine functions are 
1,2, 1,4 1,3 
l-32 +530 and z=- gt” 
respectively. 


Notice that the quartic Taylor polynomial about 0 for the cosine function 
contains terms in even powers of x only, whereas that for the sine function 
contains terms in odd powers of x only. These observations are explained 
by the facts that the cosine function is an even function and the sine 
function is an odd function, as defined in general below. 


A function f is said to be even if its graph is unchanged under reflection 
in the y-axis, as illustrated in Figure 18(a). Thus f is even if 


f(—x)= f(x), for all x in the domain of f. 


Similarly, a function f is said to be odd if its graph is unchanged by 
rotation through a half turn about the origin, as illustrated in 
Figure 18(b). Thus f is odd if 


f(—x)= -—f(x), for all x in the domain of f. 


A rotation through a half turn about the origin has the same effect as a 
reflection in the y-axis followed by a reflection in the x-axis. Hence the 
graph of an odd function is the same as an ‘upside down’ reflection of itself 
in the y-axis. 


(a) (b) 


Figure 18 (a) An even function (b) An odd function 


These definitions are summarised below. 


Even and odd functions 
A function f is 


e even if f(—x) = f(x) for all x in the domain of f 
e odd if f(—x) = —f (2) for all x in the domain of f. 
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Here are some examples. 
e =f (x) = 2? is an even function, since 
f(—a) = (=r)? = r? = f(z), forall ceER. 
e = f(x) = 2° is an odd function, since 
f(—z) = ( r)’ = -g3 = f(z), foralzrzeEeR. 
e f(x) =sinz is an odd function, since 


f(-x) = sin(-z) = — sing = — f(x), forall z ER. 


e f(x)= cosx is an even function, since 
f(—x) = cos(—x) = cosx = f(x), forall z ER. 


e f(x)= e” is neither even nor odd, since we can find a value of x, 
say x = 1, such that e~* # e” and e7” 4 —e® . (In fact, ec! ~ 0.368 
and e! ~ 2.718.) 


(Remember that the symbol € means ‘in’ or ‘belongs to’ and that the 
symbol R denotes the set of real numbers.) 


Taylor polynomials about 0 for even and odd functions have the following 
properties. 


Taylor polynomials about 0 for even and odd functions 


A Taylor polynomial about 0 for an even function contains terms in 
even powers of x only. 


A Taylor polynomial about 0 for an odd function contains terms in 
odd powers of x only. 


Here’s an explanation of why these facts are true. 


By looking at the symmetry of the graph in Figure 18(a), and thinking 
about even functions in general, you can see that if f is an even function, 
then for all x in its domain the gradient of the graph of f at —x has the 
same magnitude as the gradient at x, but the opposite sign (provided that 
the gradient exists). In other words, 

if f(—x) = f(x), for all x in the domain of f, 

then f’(—x) =- f'(x), for all x in the domain of f’. 


This means that if f is an even function then f’ is an odd function. 


Similarly, by looking at the symmetry of the graph in Figure 18(b) and 
thinking about odd functions in general, you can see that if f is an odd 
function, then for all x in its domain the gradient of f is the same at — x as 
at x (provided that the gradient exists). In other words, 


if f(—x) = — f(x), for all x in the domain of f, 
then f’(—2x) = f'(x), for all x in the domain of f’. 


This means that if f is an odd function then f’ is an even function. 
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So differentiation turns any even function into an odd function, and vice 
versa. 


Notice also that if f is an odd function whose domain contains 0, then, 
since f(—x) = — f(x) for all x in its domain, we have in particular that 
f(0) = —f(0), and it follows that f(0) = 0. Hence any odd function has 
the value 0 at 0. 


Now let f be any even function that’s differentiable infinitely many times 
at 0. It follows from the discussion above that f”, f®, f©, ..., are all 
even functions, and f’, f®, f©, ..., are all odd functions. Since any odd 
function has the value 0 at the point 0, the values of f’(0), f)(0), f©(0), 
..., are all 0. Hence, from the general formula for a Taylor polynomial 
about 0, any Taylor polynomial about 0 for f contains only terms with 
even powers of x. 


Similar reasoning applies in the case of an odd function. 


By the facts in the box above, any Taylor polynomial about 0 for the 
cosine function contains terms in even powers of x only, and any Taylor 
polynomial about 0 for the sine function contains terms in odd powers of x 
only. For example, the Taylor polynomial of degree 9 about 0 for 

f(x) =cosz is 


il 1 1 1 
plx)=1 r? + rt rÊ + — 98, 


2! 4! 6! 8! 
Similarly, the Taylor polynomial of degree 9 about 0 for f(x) = sin is 
1 1 1 1 
pig) =z a i ar ar i ~ 


You can find these Taylor polynomials by extending the solution to 
Activity 9, and noting the patterns in the coefficients of the Taylor 
polynomials. These patterns continue indefinitely for higher-degree Taylor 
polynomials for the cosine and sine functions. You’ll be asked to verify this 
later in the unit. 


Taylor polynomials of degree n about a ¥ 0 


So far in this section, all the examples and activities have involved Taylor 
polynomials about 0. The next example and activity involve Taylor 
polynomials about another point a. So here we use the general formula for 
a Taylor polynomial given by equation (2) on page 121. 
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A Example 7 Finding a quartic Taylor polynomial about a point 
— other than 0 
Find the quartic Taylor polynomial about 1 for the function 
Cc) = la. 
Solution 
@. Repeatedly differentiate f to find f’, f”, f® and f™, and find 
the values of f(1), f’(1), F"), FO) and f(1). A 


The first four derivatives of the function f(x) = ln z are as follows: 


1 il 2 
ræ T m f" (x) T o f(z) = mo’ 
3x2 3! 
S 7 eee 


@. The product 3 x 2 has been simplified to 3! rather than 6 here to 
highlight the emerging pattern. ® 


So 

OD) =U, 1 FO 

PM 
2. a formula (2) on page 121. Remember that in this case 
@ = Il, J 


Hence the quartic Taylor polynomial about 1 for the function 
Pao) = lme i 


plz) =0+1x (2-1) + LY @ 1+ 2 (e198 


that is, 


As you saw earlier in the case of quadratic Taylor polynomials, we usually 
leave a Taylor polynomial about a point a as a sum of terms each of which 
is the product of a constant and a power of x — a, rather than multiplying 
it out. However, we might make an exception for a polynomial of low 
degree for which the multiplied-out form is simpler. 
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The graphs of the function f(x) = lnx and the quartic Taylor polynomial 
about 1 found in Example 7 are shown in Figure 19. As you’d expect, the 
quartic Taylor polynomial appears to be a better approximating 
polynomial than the quadratic Taylor polynomial about 1 for f shown in 
Figure 14 on page 118. 


Figure 19 The quartic Taylor polynomial about 1 for f(x) = ln æ 


If, in the solution to Example 7, you look at the patterns in the formulas 
for f, f’, f", f®, ..., and hence in the values of f(1), f’(1), f”(1), 
f®A), ..., then you can see that the pattern of terms in the Taylor 
polynomial found in the example continues as the degree of the Taylor 
polynomial increases. So, for any positive integer n, the Taylor polynomial 
of degree n about 1 for f(x) = ln z is 


p(n) = (e-1)— He 1)? + Aw — 1)? — + (11S (e - 1)" 


The expression (—1)"~! in the final term here is, as you saw in Unit 10, a 
neat way to give the term a negative sign when n is even and a positive 
sign when n is odd. This expression can also be written as —(—1)", since 


eis ise e 


Activity 11 Finding a cubic Taylor polynomial about a point other 
than 0 


Find the cubic Taylor polynomial about 7/6 for the function f(x) = sin z. 


The graphs of the function f(x) = sin x and the cubic Taylor polynomial 
about 7/6 found in Activity 11 are shown in Figure 20. 
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Figure 20 The cubic Taylor polynomial about 7/6 for f(x) = sin x 


The cubic Taylor polynomial about 7/6 for the function f(x) = sinz, 
which you were asked to find in Activity 11, contains terms in (a — 7/6)* 
with k even as well as with k odd. This is not surprising, since the earlier 
discussion about even and odd functions applies only to Taylor 
polynomials about 0. 


Activity 12 Investigating graphs of Taylor polynomials 


Open the Graphs of Taylor polynomials applet. Initially it shows the 
graphs of the function f(x) = e” and its Taylor polynomial of degree 1 
about 0. 


(a) Increase the degree n of the Taylor polynomial and observe the effect 
on its graph. 


(b) Try changing the function f and then varying n. 
(c) Try changing the centre a of the Taylor polynomial and then varying n. 


(d) Try some other functions and other centres for the Taylor polynomial. 


Using sigma notation for Taylor polynomials 


Sigma notation for series, which you met in Unit 10, provides a concise 
way to write down Taylor polynomials. Formula (3) on page 121, for the 
Taylor polynomial of degree n about 0 for a function f, is 


1 (3) (n) 
4 Lge U a 


p(x) = f(0) + f'(0) 


2 Taylor polynomials of any degree 


This polynomial can be written in sigma notation as 


n p(k) 
ra) = o£ ua g". 


k=0 


(Here f is interpreted to mean f itself. Also, by convention, 0° is taken 
to have the value 1 in series of this type. Recall also that 0! = 1.) 


For example, the quartic Taylor polynomial about 0 for the function 
f(x) = e”, which was found in Example 6, is 


4 


You can often express particular Taylor polynomials concisely in this way, 
once the pattern of terms is clear. 


2.2 Taylor polynomials for approximation 


In this subsection we’ll use Taylor polynomials to calculate approximations 
for values of functions at particular points. In doing so, we’ll compare 
approximations obtained from Taylor polynomials of different degrees, and 
for clarity we need a notation that indicates the degree of each Taylor 
polynomial. The notation that we use is to denote a Taylor polynomial 

by pn, where n is its degree, rather than by just p. Thus, for example, the 
Taylor polynomials of degrees 1 and 2 about 0 for the function f(a) = e” 
are pi(z) = 1 +x and po(z) = 14+ 2+ 452", respectively. 


You’ve seen that usually the greater the degree of a Taylor polynomial p,, 
about a point a for a function f, the more accurate p,,(z) is as an 
approximation to f(x) for values of æ close to a. 


For example, consider the Taylor polynomials about 0 for the 
function f(x) = e”. You saw earlier that, for any positive integer n, the 
Taylor polynomial of degree n about 0 for this function is 
= l 2 1 3 1 n 

Prlz) =1 +s+ ta” Prie f 
Figure 21 shows the graphs of the Taylor polynomials of degrees 0, 1, 2 
and 3 about 0 for the function f(x) = e”, together with the graph of 
f(x) = e” itself. As you’d expect, it appears that as the degree of the 
Taylor polynomial increases, its graph approximates the graph of f near 0 
more and more closely. 
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it 
(c) po(w) =1l+at+ T 


Figure 21 Taylor polynomials about 0 for f(x) = e” 


Table 2 provides a numerical illustration for the same function, f(x) = e7, 
for a particular value of x near 0, namely z = 0.25. The value of e®?5 is 
1.284 025 4167, to ten decimal places. For values of n from 0 to 8, the table 
gives the Taylor polynomial p,,(x) about 0 for f, the value of this 
polynomial when x = 0.25, and the associated remainder. All the values 
are given to ten decimal places. You can see that as the degree n of the 
Taylor polynomial increases, the accuracy of p,(0.25) as an approximation 
for e°-2° improves. 


Table 2 Successive Taylor polynomial approximations for e°-?° 
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n Prlz) Dn(0.25) 9-25 — », (0.25) 
0 1 1 0.284 025 4167 
1 1l+2 1.25 0.034 025 4167 
2 1+g+g?/2! 1.281 25 0.002 775 4167 
3 lt+a+27/2!+ 23/3! 1.283 8541667 0.000171 2500 
4 1t+a+a7/2!+---+a*/4! 1.2840169271 0.000 008 4896 
5 ltat+a2?/2i+---+2°/5! 1.2840250651 0.000 000 3516 
6 ltat+a?/2!+---+2°/6! 1.284025 4042 0.000000 0125 
7 l+a+a27/2!+---+27/7! 1.2840254163 0.000000 0004 
8 l+ata?/2!+---+28/8! 1.284025 4167 0.000000 0000 


2 Taylor polynomials of any degree 


Table 2 shows that for f(x) = e” the Taylor polynomial of degree 8 

about 0 for f gives a method of calculating the value of e°-?> correct to 10 
decimal places by using only the standard arithmetical operations of 
addition, subtraction and multiplication. (Raising to a power is just 
repeated multiplication.) 


In general, Taylor polynomials can often be used to calculate 
approximations for values of functions to any desired accuracy. If f is a 
function and x is a particular value in the domain of f, then to find an 
approximation for f(a) we calculate a Taylor polynomial for f about some 
suitable point a close to x, and then evaluate it at x using only the 
standard arithmetical operations. 


Unfortunately, there’s no easy method for determining a suitable degree 
for the Taylor polynomial in any individual case. However, there’s a ‘rule 
of thumb’ that works in many cases, and in particular in most of the cases 
that you’re likely to come across. If you want an approximation accurate 
to m decimal places, then you calculate approximations using Taylor 
polynomials of degree 1, 2, 3, and so on, until two successive 
approximations agree to m + 2 decimal places. (You start with degree 1 
rather than 0 because constant Taylor polynomials rarely give useful 
approximations.) This method is illustrated in Example 8 below. 


Note that when we say that two numbers agree to a particular number of 
decimal places, we mean that the values resulting from rounding them to 
that number of decimal places are equal. Thus, for example, 0.237 and 
0.241 agree to two decimal places, since in each case rounding to two 
decimal places gives 0.24. However, 0.241 and 0.247 don’t agree to two 
decimal places, since rounding to two decimal places gives 0.24 in the first 
case and 0.25 in the second. 
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pa(1.1) = p3(1.1) — 4 (1.1 — 1)* = 0.095 308 
ps(1.1) = p4(1.1) + £ (1.1 — 1) = 0.095 310 
po(1.1) = ps (1.1) — % (1.1 — 1)® = 0.095 310. 


The values of p5(1.1) and pg(1.1) agree to six decimal places, so it is 
likely that 


In(1.1) = 0.0953 


to four decimal places. 


You can check using your calculator that it is indeed true that 
In(1.1) = 0.0953 to four decimal places, as obtained in Example 8. 


In Example 8, each successive approximation p,(1.1) was calculated by 
evaluating just the final term of pn(x) with x = 1.1, and then adding this 
value to pp—1(1.1), the previous approximation. This is an efficient way to 
proceed, but when working through a similar example yourself, you must 
make sure that each time you add an evaluated term to the previous 
approximation, you use the full-calculator-precision version of the previous 
approximation, rather than the rounded version that you just wrote down. 
Not doing so will cause errors in some cases. 


If you have a modern calculator, then you should be able to carry out this 
procedure without having to write down the unrounded values. Each time 
you want to add a new term, you can access the previous answer and 
calculate and add the new term, all in one step. After each such addition 
you can round off the approximation and write it down. 


Activity 13 Finding an approximate value of a function 


You saw on page 123 that, for each positive integer n, the Taylor 
polynomial of degree n about 0 for f(x) = e” is 


I 1 il 
palz] = Drato ep ae es 
Use these Taylor polynomials to find the likely value of e~°- to four 
decimal places. 


The next example is similar to Example 8 and Activity 13, but it involves 
Taylor polynomials about 0 for the sine function. Since this function is 
odd, its Taylor polynomials about 0 contain no even powers of x, as 
explained on page 126. Hence each Taylor polynomial of even degree is the 
same as the Taylor polynomial of degree one less; that is, po(x) = pı (x£), 
pa(x) = p3(x), and so on. You would therefore rapidly find two successive 
approximations that agree to any specified number of decimal places, but 
this would tell you nothing about the accuracy of the approximation! For 


2 Taylor polynomials of any degree 


this reason it makes sense to consider only the Taylor polynomials of odd 
degree for this function. 


Example 9 Finding an approximate value of an odd function A 


The Taylor polynomials of odd degree about 0 for f(x) = sina were — 
discussed on page 127. They are 


1 1 il 
pı(z)=z, ps(z)=z- a? ps(z) =x — a + w 
il iL 1 
pr(z) =a — Thal I ae = ae and so on. 


Use these Taylor polynomials to find the likely value of sin(0.2) to six 
decimal places. 


Solution 


@. Calculate p; (0.2), p3(0.2), ps (0.2), and so on. You want to find a 
pair of successive values that agree to 6 + 2 = 8 decimal places, so 
calculate values to 8 decimal places. Remember that each successive 
polynomial is obtained by adding a new term to the previous 
polynomial. © 


Calculating values to eight decimal places, we obtain 


pi(0.2) = 0.2 


p3(0.2) = pı (0.2) — - 


5 (0.2)? = 0.198 666 67 
i 
ps5 (0.2) = p3(0.2) + i (0.2)° = 0.198 669 33 


p7(0.2) = ps(0.2) — - 


The values of p5(0.2) and p7(0.2) agree to eight decimal places, so it is 
likely that 


sin(0.2) = 0.198 669 


(0.2)" = 0.198 669 33. 


to six decimal places. 


You can check using your calculator that it is indeed true that 
sin(0.2) = 0.198 669 to six decimal places, as obtained in the solution to 
Example 8. 
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Activity 14 Finding an approximate value of an even function 


The Taylor polynomials of even degree about 0 for f(x) = cosx were 
discussed on page 127. They are 


i 1 1 
po() — 1, p2(x) =1- T pa(x) i ae + ae 
1 1 1 
pelz) = 1 — 52? + —2* — — 2°, and so on. 
21 4! 6! 


Use these Taylor polynomials to find the likely value of cos(0.2) to six 
decimal places. 


In the final activity of this section you can use an applet to see the graphs 
of a variety of functions and their Taylor polynomials. 


3 Taylor series 


In Section 2 you saw that, usually, the greater the degree of a Taylor 
polynomial about a point a for a function f, the more accurate the Taylor 
polynomial is as an approximating polynomial for f close to a. But what 
happens if we take a Taylor polynomial of ‘infinite degree’; that is, if we 
add on all possible terms? We’ll look at that in this section. 


3.1 What is a Taylor series? 


In Activity 10 on page 124 you saw that the Taylor polynomial of degree n 
about 0 for the function f(x) = 1/(1 — x) is 


Pr(x)=1l+atartart-- +2". 


Let’s now consider what happens to this expression if we include all 
possible terms. The result is an infinite series, 


LEHL HH. (4) 


From what you saw in Section 2, you’d expect that for any value of x close 
to 0, as you add on more and more terms to the infinite series above, the 
resulting sums will approach the value of 1/(1 — x) more and more closely. 


As you saw in Unit 10, this is the same as saying that for any value of x 
close to 0, the infinite series has a sum, and the sum is given by 1/(1 — x). 


But how close to 0 does x have to be for this to happen? The answer is 
that x must be in the range —1 < x < 1. For any value of x in this range, 
the infinite series has sum 1/(1 — x). For any other value of x the series 
doesn’t have a sum at all. 


These facts follow from a result that you met in Unit 10. Consider the 
infinite geometric series with first term a and common ratio r: 


atar+ar?+are+---. 
You saw in Subsection 4.2 of Unit 10 that this series has 


sum —, if -l<r<l; 


no sum, ifr <-—lorr>l1. 


The infinite series (4) above is an infinite geometric series, with first term 1 
and common ratio x. Hence it has 


1 
sum ——, if —l<a<l; 
leg 


no sum, ifa<-lorge>1, 
as stated above. 
For example, if x = 0.5 then series (4) is 
1+ 0.5 + (0.5)? + (0.5)? +--- =14+0.5+0.25 40.125 +---, 


and it has sum 

1 = 

1-05 — 

In other words, for this infinite series, as more and more terms are added 
the resulting sum approaches 1/(1 — 0.5) more and more closely. 


2. 


By contrast, if x = 2 then the function f(x) = 1/(1 — x) has value 
f(2) =1/(1 — 2) = —1, but series (4) is 


1+2+2 OF 6 OF ST ed Ba IG ee 5 


and this infinite series has no sum. As more and more terms are added, 
the resulting partial sums get larger and larger, without approaching any 
particular value. 


In general, if f is a function that’s differentiable infinitely many times at a 
point a in its domain, then you can form an infinite series in which, for any 
integer n > 0, the first n + 1 terms form the Taylor polynomial of degree n 
about a for f. This series is called the Taylor series about a for f. 


For example, series (4) is the Taylor series about 0 for the function 
f(z) =1/(1 — z). 
You can obtain formulas for Taylor series from those for Taylor polynomials 


of degree n, by taking infinitely many terms. The following general 
formulas for Taylor series follow from formulas (2) and (3) on page 121. 


3 Taylor series 
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They're called Taylor series about a 
Taylor series Let f be a function that is differentiable infinitely many times at a 
because they're point a. The Taylor series about a for f is 
Taylor-made to fit! " (3) 
aroe e + EO ey? 
(n) 
pep ig apo (5) 


The point a is called the centre of the Taylor series. 
When a = 0, the Taylor series becomes 


£0) 2, £90) a LO) 


fO)+/Or+ a a 


This series is also known as the Maclaurin series for f. 


In this module, we usually refer to Taylor series about 0, rather than 
Maclaurin series. 


Notice that in the box above the general term of each series, 
involving (x — a)” (or x”, in the case a = 0), has been written down 
explicitly as part of the series. This helps to clarify the general pattern. 


Colin Maclaurin was born in Argyllshire, studied at the University of 
Glasgow and became professor of mathematics first at Marischal 
College, Aberdeen and then at the University of Edinburgh. In 1742 
he published the two-volume Treatise of Fluaions, which was the first 
systematic exposition of Newton’s methods in calculus. He wrote it as 
a reply to attacks made on calculus for its lack of rigorous 
foundations. In this treatise Maclaurin uses Taylor series about 0. 
Although he acknowledged Taylor, Maclaurin’s name is now often 
used to describe these series. 


kaneya 


Colin Maclaurin (1698-1746) 


® Example 10 Finding a Taylor series about 0 
— Find the Taylor series about 0 for the function f(x) = e”. 
Solution 


@. Repeatedly differentiate f to find f’, f”, f®, ..., and find the 
values of f(0), f’(0), f”(0), f® (0), .... Then apply formula (6). @ 


Here f(0) = e? = 1. Also, the nth derivative of the function f(x) = e” 
is f(x) = e”, so f™(0) = 1 for all positive integers n. 
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Hence, by the formula for a Taylor series about 0, the required Taylor 
series is 


l arl grla 
Dee ia o oee o 


In the next activity you’re asked to find the Taylor series about 0 for the 
cosine and sine functions. 


Activity 15 Finding Taylor series 


Find the Taylor series about 0 for each of the following functions, writing 
down enough terms to make the general pattern clear. 


(a) f(x) = cos z (b) f(x)= sing 


In each case you should be able to see a pattern in the values f(0), f’(0), 
f""(0), f (0), ..., from your working for Activity 9. 


It follows from what you saw earlier about Taylor polynomials for even and 
odd functions that the Taylor series about 0 for an even function contains 
terms in even powers of x only, and the Taylor series about 0 for an odd 
function contains terms in odd powers of x only. The Taylor series in 
Activity 15 are examples of this fact. Remember that the cosine function 
is an even function and the sine function is an odd function. 


All the Taylor series that you’ve seen so far in this section have had 
centre 0. In the next activity you’re asked to find a Taylor series with a 
different centre. So here you need to use formula (5), for a Taylor series 
about a point a, rather than formula (6), for a Taylor series about 0. 


Activity 160 Finding a Taylor series about a point other than 0 


Find the Taylor series about 7/2 for the function f(x) = sin x, writing 
down enough terms to make the general pattern clear. 


Validity of Taylor series 


You’ve seen that the Taylor series about a point a for a function f usually 
has sum f(x) for values of x close to a, but may not have sum f(x) for 
other values of x. If x is a point for which the Taylor series about a for f 
has sum f(x), then we say that the Taylor series is valid at the point zx. 


For example, you saw at the beginning of this subsection that the Taylor 
series about 0 for the function f(x) = 1/(1 — x) is valid for all x in the 
interval —1 < x < 1, but isn’t valid for values of x outside this interval. 
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You may be surprised to learn that the Taylor series about 0 for the 
exponential, sine and cosine functions (found in Example 10 and 
Activity 15) are all valid for every real number x. In other words, the 
equations 


1 1 1 
zT — 2 3 4 ipi 
e =ltat+ gT Tae tg” + 3 
: = 1 3 1 5 1 7 1 9 
sın Tt = r 3 a 7 ai 5 
= 1 2 1 4 1 6 1 8 
osgp=1- T va él” al” pinsin 


are true for all x € R. 


When you remember that the coefficients of a Taylor series for a function 
are chosen by taking into account the value of the function and its 
derivatives at a single point a, it may seem amazing that the resulting 
series can turn out to be equal to the function for every real number z! 


A Taylor series about a point a is always valid for x = a. This is because if 
we set x = a in formula (5) on page 138, then all the terms except the first 
are equal to zero, so the sum of the series is just the first term f(a), which 
is precisely the value of f at a. This is by design, since the first term of a 
Taylor polynomial is chosen to be f(a) to ensure that the value of the 
polynomial at a is the same as the value of f at a. 


Often, though not always, the largest set of points for which a Taylor series 
about a point a for a function is valid is either the whole set of real 
numbers R, or an interval with two endpoints whose midpoint is a. Each 
endpoint may or may not be included in the interval. 


Any interval of values of x for which a Taylor series is valid is called an 
interval of validity for the series, and the series is said to represent the 
function on any interval of validity. For example, —1 < x < 1 is an interval 
of validity for the Taylor series about 0 for the function f(x) = 1/(1 — 2). 
Any interval that is contained within the interval —1 < x < 1, such as 

-4 LEZ $, is also an interval of validity for this series, but —1 <a < 1 is 
the largest such interval. This interval could also be denoted by (—1, 1), 
using the usual notation for an open interval, but the inequality notation is 
more usual in the context of Taylor series, and it will be used throughout 
this unit. 


You might wonder how it can be determined that R is an interval of 
validity for the Taylor series about 0 for the exponential, sine and cosine 
functions, and more generally, how the largest interval of validity for a 
Taylor series can be found. There’s a method for doing this, as follows. 
You first find a formula for the remainder r(x) = f(x) — pn(x), which is 
the difference between the value of the function f at x and that of the 
Taylor polynomial of degree n about a for f. Then you have to decide for 
which values of x this remainder r(x) tends to 0 as n tends to oo. The 
techniques that you need to do this are taught in more advanced modules 
on pure mathematics. You won’t be expected to find intervals of validity in 
this module, except in Section 4 where you’ll be working from known 
results. 


Using sigma notation for Taylor series 


Formula (5) on page 138 for the Taylor series about a for a function f can 
be written concisely in sigma notation as 


You can often write Taylor series for particular functions concisely in 
sigma notation in a similar way. For example, the Taylor series about 0 for 
the function f(x) = e”, given in Example 10, is 


1+24 ae jot ete. = Sa", 
In the cases of odd and even functions, it’s more awkward to write down 
the Taylor series about 0 in sigma notation, because for even functions 
such series contain only even powers of x, and for odd functions they 
contain only odd powers of x. The Taylor series for the function 

f(x) = sina, given on page 140, is usually written as 


1 gs, ts 117z, 1.6 OS (D ony 

x 31° a 77 tga? = Loreal : 
Here, for n = 0,1,2,..., the expression x?"+1 equals x, zì, 2°, ... , while 
the expression (2n + 1)! takes the values 1, 3!, 5!,... , and the expression 


(—1)” deals with the alternating signs. 


The Taylor series for the function f(x) = cos x, given on page 140, is 
usually written as 


re ee ee ee ee 8 =x 1)" Gs, 
n=0 
Here, for n = 0,1,2,..., the expression x?” equals 1, x”, xt, ... , while the 
expression (27)! takes the values 0! = 1, 2!, 4!, ... , and again the 


expression (—1)" deals with the alternating signs. 


Taylor series about 0 for other odd and even functions can be written in 
sigma notation in a similar way. 


3.2 Some standard Taylor series 


In Subsection 3.1 we obtained the Taylor series about 0 for the exponential, 
sine and cosine functions, and for the function f(x) = 1/(1 — x). For ease 
of reference, these series are stated in the following box, along with the 
Taylor series about 0 for two other standard functions. These Taylor series 
can be obtained by using the second formula in the box on page 138. The 
following box also gives intervals of validity for the Taylor series. 
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Shall we recora 
the whole series? 


Standard Taylor series about 0 
sal 


GBs je a Page eR RM ey til, eg acai ae 
sing =x ay @ taf ae tot , forxER 
= 1 2 a l orl 
2 Ta cg a P for x E R 
E sa cee ee for x E€ R 
E 2! 3! 4! ; 
In(1 +g) =x įr? + ir? jal teo, for —l <a < 1 
1 
ee e for -l<a<l 
-T 
— 1 —1 —2 
Otol Sten SS a a 


for —1 < x < 1 (where a is any real number) 


The last series here is called the binomial series. You’ll see shortly how 
it’s linked to the binomial theorem, which you met in Unit 10. 


The binomial series was discovered by Isaac Newton. 


Notice that each of the intervals of validity given in the box is an open 
interval whose midpoint is the centre 0 of the series. These intervals of 
validity are the largest intervals for which the series are valid, with two 
exceptions. The series for In(1 + x) is also valid when x = 1, but the box 
gives the interval —1 < x < 1 because it’s often convenient to work with 
open intervals. For example, this is the case when you're differentiating 
and integrating Taylor series, as you'll see later in the unit. 


The other exception involves the binomial series. For —1 < x < 1, this 
series sums to (1 + x)” for any real number a, including negative and 
fractional numbers. For most values of a, the largest interval of validity is 
—1 < x < 1, but when a is a non-negative integer, the series is valid for 
every real number x. You’ll see after the next example why this is so. 


The box gives a Taylor series about 0 for the function In(1 + x), rather 
than for the standard function ln x. This is because the function Ina has 
no Taylor series about 0, since its domain (0,00) does not contain 0. 


Note that it’s often convenient to say ‘the function ln(1 + x)’, as in the 
paragraph above, as a shorthand for a more precise statement such as ‘the 
function f(x) = ln(1 + 2)’. 


Example 11 Using the binomial series 


Use the binomial series to find the Taylor series about 0 for each of 
the following functions. In each case state an interval of validity for 
the series. 


(a) f(x) = (b) f(z) =(1+2)* 


Solution 


(a) Since 1/(1 +2) = (1 + x)`!, we take a = —1 in the binomial 
series, to give 

1 (A 23 

ae ee eee epee 


=1l-a2+a@?-a4+a4-.... 


@. When writing down an infinite series, remember to include 
either +--- or —--- at the end, to indicate that there are further 
terms. # 


This Taylor series is valid for —1 <a < 1. 
(b) Taking a = 4 in the binomial series gives 
4x3 5, 4x3x2 3 
ees eee 
4x3x2x1l,. 4x3x2x1x0, 
m 5l í 
4x3x2x1x0x(-1) e 


= 1 + 4r + 627 + 4r? + ot. 


(ene = l o 


This Taylor series is valid for —1 < x < 1. 


®, In fact this Taylor series is valid for z € R, which would be an 
equally appropriate answer. (See the discussion below.) ® 


Example 11(b) illustrates that it’s possible for a Taylor series to have a 
finite number of terms. This occurs when the coefficients of all terms from 
some term onwards are zero. The series in this example may also look 
familiar to you: it is the binomial expansion of (1 + 2)*. 


If a is any positive integer, then all terms after the term in x° in the 
binomial Taylor series for (1 + x)” contain the factor a — a = 0, and are 
therefore equal to 0. The series is then the same as the binomial expansion 
of (1+ x)°, which is valid for all x € R. The binomial series therefore 
generalises the binomial expansion of (1 + x)” from cases where a is a 
positive integer to cases where a can be any real number. 
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Activity 17 Using the binomial series 


Use the binomial series to find the Taylor series about 0 for the function 
f(x) =1/(1+ 2)?. (Write down enough terms to make the general pattern 
clear.) State an interval of validity for the series. 


Activity 17 involved using the binomial series when the power of 1 + z is a 
negative integer. The next example illustrates using the binomial series 
when the power of 1 + z is a fraction. 


Example 12 Finding a binomial series for a fractional power of 1 + x 


Use the binomial series to find the Taylor series about 0 for the 
function f(a) = (1+2)~!/?. (Write down enough terms to make the 
general pattern clear.) State an interval of validity for the series. 


Solution 
Taking a = -4 in the binomial series gives 
-i3 -I Sye 
ata) a1 4 (fet COED 2, CDCDED 2 
1 3 5 7 
E 
4! 
13 IL S< ahsx 
=a pene Oa 3 
See oe oe oe. 
IBD an 


eal 
This Taylor series is valid for —1 <a < 1. 


In Example 12, the coefficients in the Taylor series have been left in the 
form shown, rather than completely evaluated, so that the pattern involved 
is clear. It’s usually a good idea to do this, but sometimes you may be 
asked to evaluate the first few coefficients of a Taylor series explicitly, in 
which case you should write each coefficient as a single integer or fraction. 


Activity 18 Finding a binomial series for a fractional power of 1 + x 


Use the binomial series to find the Taylor series about 0 for the function 
f(x) = (1 + 2)!/2. (Write down enough terms to make the general pattern 
clear.) State an interval of validity for the series. 


Once you know the Taylor series for a function f about a point a, you can 
find the corresponding Taylor polynomial of any degree n by truncating 
the series at the appropriate term. (To truncate a series at a term is to 
delete all subsequent terms.) 


Activity 19 Using a Taylor series to find a Taylor polynomial 


You saw earlier that the Taylor series about 0 for the function 
f(z) = ln(1 +2) is 


In(l +2) = gx- $a? + ir? — iatt gas, for -l<2<1l. 


Using this series, write down the cubic Taylor polynomial about 0 for this 
function f. 


The Taylor polynomials obtained by truncating a Taylor series for a 
function f can, in principle, be used to find approximations for f(x) for all 
values of x for which the series is valid. For example, you’ve seen that the 
Taylor series about 0 for the function f(x) = ln(1 + 2) is valid for all 
values of x in the interval —1 < x < 1. This means that, in principle, you 
can use Taylor polynomials about 0 to find an approximation for In(1 + x) 
for any value of x in this interval. However, the further x is from the 
centre 0 of the Taylor series, the greater is the degree of the Taylor 
polynomial that you need to provide the desired level of accuracy. 


For instance, you can find an approximation for In(1.1) by putting 

x = 0.1 in a Taylor polynomial about 0 for In(1 + x), and you can find 

an approximation for In(1.5) by putting x = 0.5 in the same Taylor 
polynomial. However, to find an approximation for In(1.1) correct to 

four decimal places, by using the method of Subsection 2.2 and obtaining 
the required Taylor polynomials by truncating the Taylor series at the 
appropriate terms, you have to evaluate six successive Taylor polynomials, 
whereas to find an approximation for In(1.5) to the same level of accuracy 
you have to evaluate 17 successive Taylor polynomials. For In(1.9) you 
need 92 successive Taylor polynomials! 


In the next activity you’re asked to use the method of Subsection 2.2 to 
find an approximation for a particular value f(x) of a function f, 
obtaining the required Taylor polynomials by truncating the Taylor series 
for f about a point a close to x. 
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Activity 20 Finding an approximate value for a function 


You saw in Activity 18 that the Taylor series about 0 for the function 
f(x) =(1+2)"? is 
1 1x3 1x3x5 
1/2 Wey ce E: 3 4). 
rey = tt al geo “ose” aaa © 
and that it is valid for —1 < x < 1. By writing 1.1 as 1 + 0.1, use this 
series to find the value of V1.1 to three decimal places. 


(Notice that x = 0.1 lies within the interval of validity —1 < x < 1 for the 
Taylor series.) 


3.3 Using a computer to find Taylor polynomials 


In the following activity you'll learn how to use the module computer 
algebra system to find Taylor polynomials. 


Activity 21 Taylor polynomials on a computer 


Work through Section 12 of the Computer algebra guide. 


4 Manipulating Taylor series 


In this final section, you’ll see some methods that allow you to obtain 
Taylor series for many functions from a few known Taylor series such as 
the standard ones in the box on page 142. This usually involves much less 
work than obtaining the required Taylor series by using one of the general 
formulas in the box on page 138. 


When finding a Taylor series for a function, you can use any of the 
standard Taylor series. You’re not expected to derive any of the standard 
series unless explicitly asked to do so. 


4.1 Substituting for the variable in a Taylor series 


You’ve seen that the Taylor series about 0 for the function 
g(x) = 1/(1 — x) is given by 
1 


fa ee ee en for -l<a<l. 
-T 


4 Manipulating Taylor series 


Consider the effect of substituting x = 2t in this equation. This gives 


pS BE OE) (27)? 4 eos 
joy t (2t) + (2t)° + 


=1 +20 +4 +8 HH. 


We have obtained a series equal to 1/(1 — 2t). Since the Taylor series for 
g(x) = 1/(1 — x) is valid for —1 < x < 1, the series in t above is equal to 
1/(1 — 2t) for —1 < 2t < 1, that is, for —$ < t < ż. 


Let’s now replace t by x, since it’s more usual to use x rather than t for 
the variable. This gives 
1 

1 — 2g 
The series in equation (7) is equal to the function f(x) = 1/(1 — 2x) for 
4 <au< $, but is it a Taylor series? It’s of the right form to be a Taylor 
series about 0, since each of its terms is a power of x multiplied by a 
constant. However, if we were to use formula (6) on page 138 to find the 
Taylor series about 0 for the function f, would we obtain the same series? 
The answer to this question is yes. This follows from the following fact, 
whose proof is beyond the scope of this module. 


= 1 +2 +42? +82? +, for-3<a<}. (7) 


Uniqueness of Taylor series 
Let f be a function. If you can by any means find a series 
co + e1(a — a) + co(x — a)? teor — a)? +--- 


that is equal to f(x) for all x in some open interval containing a, then 
this series is the Taylor series about a for f, and hence it is the only 
series of this form that is equal to f(x) for all x in that interval. 


You can assume this important fact throughout the rest of this unit. 


You can find Taylor series for many functions by substituting for the 
variable in a Taylor series that you already know, and you can often 
deduce an interval of validity for the new series from an interval of validity 
for the original Taylor series. When substituting for the variable, it’s 
quicker to avoid introducing a new variable t as was done above, and 
instead replace x in the original Taylor series by an expression involving x, 
as illustrated in the next example. 


Example 13 Substituting into a Taylor series CS) 


Find the Taylor series about 0 for the function 


il 
DOO emer 


and determine an interval of validity for this series. 
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Solution 


@. The expression 1/(1 + x?) is similar to 1/(1 — x), for which there’s 
a standard Taylor series (given on page 142). The first expression is 
obtained from the second expression by replacing x by —x?. ® 
The Taylor series about 0 for 1/(1 — x) is 
1 ns 
e ee ae tare+---, for -l<a<l. 
-zr 


Replacing each occurrence of x by —x? gives 


ay = la (0?) + (0? + (a?) (8) 
1 — (=z?) 

that is, 
1+2? 


This is the Taylor series about 0 for 1/(1 + 27). 


@. Deduce an interval of validity for the new series from the interval 
of validity for the original series. ® 


The Taylor series for 1/(1 — x) is valid for —1 < x < 1, so the series 
for 1/(1 — (—2?)) is valid for —1 < —2? < 1. 

@. The double inequality —1 < —a? < 1 is equivalent to the two 
single inequalities —1 < —x? and —x? < 1. ® 


The left-hand inequality is —1 < —a?, which is equivalent to 1 > z?; 
that is, -l<a<l. 

The right-hand inequality is —x? < 1, which is equivalent to x? > —1 
and therefore does not place any restriction on x, since the square of 
any real number is non-negative. 


Thus the Taylor series about 0 for 1/(1 + x?) is valid for -1 < x <1. 


In Example 13, the variable x was replaced by —x?. This is equivalent to 
making the substitution z = —t? and then replacing t by z. 


As illustrated in Example 13, when you deduce an interval of validity for a 
Taylor series from an interval of validity for another Taylor series, you 
usually have to rearrange inequalities. Rules for rearranging inequalities 
were given in Subsection 5.2 of Unit 3. 


When you have to rearrange a double inequality like —1 < —x? < 1, it’s 
often helpful to split it into two single inequalities and rearrange each 
independently, as was done in Example 13. With simple double 
inequalities, such as —1 < 2% < 1, you may be able to rearrange both 
single inequalities together; for example, in this case we simply multiply 
both inequalities by i to obtain 5 LIZ 5: 


When you replace each occurrence of x in a Taylor series by an expression 
involving x, it’s helpful to enclose the whole expression in brackets at each 
replacement, and then simplify the resulting terms, as illustrated in 
Example 13. Make sure that you enclose the whole expression in brackets. 
For instance, in equation (8) the third term is (—x?)? = z4, not 

—(x7)? = —24. 


Activity 22 Substituting into a Taylor series 


By substituting for the variable in a standard Taylor series, find the Taylor 
series about 0 for each of the following functions. In each case determine 
an interval of validity for the series. 


(a) f(z) =1/(1+22) (b) f(æ@)=m(1-s) (c) f(x) =n(1 4:32) 
(d) f(x) =e" 


You’ve seen that substituting for the variable in a Taylor series gives a 
Taylor series for another function. In each case so far, the new series has 
had the same centre as the original series. However, some substitutions 
lead to a new Taylor series with a different centre. For example, suppose 
that the Taylor series about 0 for a function g is 


g(x) = co tere + con? + cgr? +- 


If you replace each occurrence of x by x — a, then you obtain 


g(a —a) = co + c (£ — a) + co(x — a)? + c3(£ — a)? +- 


This is the Taylor series about a for the function f given by 
f(x) = g(x — a). You're asked to use this fact in the following activity. 


Activity 23 Changing the centre of a Taylor series 


In Example 11(a), the binomial series was used to show that the Taylor 
series about 0 for 1/(1 + x) is 
: =l-g4e¢-7 +s, for —1<% <1. 
1+2 
By replacing x by x — 1 in this series, find the Taylor series about 1 for the 
function f(x) = 1/2. Determine an interval of validity for this series. 


Remember that you can be sure that the series found in the solution to 
Activity 23 is indeed the Taylor series about 1 for 1/x, because of the fact 
about the uniqueness of Taylor series in the box on page 147. 


Sometimes you can find a Taylor series for a particular function by 
rewriting its rule to make it more similar to a function whose Taylor series 
you already know, and then replacing the variable x by a suitable 
expression in x. This is demonstrated in the following activity. 
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Activity 24 Rearranging in order to find a Taylor series 


(a) Find the Taylor series about 0 for the function 
3 


MOSTE 
by writing 
3 1 
f(a) = 


3+2 1+ 2a 
Determine an interval of validity for this series. 


(b) Find the Taylor series about —1 for the function 


Ie) = sae 
by writing 
1 1 
f(z) 


~ 3422 14+2(¢+1) 
Determine an interval of validity for this series. 


4.2 Adding, subtracting and multiplying Taylor 
series 


Another way to find Taylor series for some functions is to apply standard 
arithmetical operations to known Taylor series, doing this term by term, as 
illustrated in the following example. 


You can add or subtract any two Taylor series with the same centre term 
by term in the way demonstrated in Example 14. The resulting Taylor 
series is valid for all values of x for which both original Taylor series are 
valid, and possibly for a larger interval of values. 


Activity 25 Subtracting Taylor series 


The following standard Taylor series about 0 was stated earlier (page 142): 
In(1 +g) =x- 32° +423 —-fat+--. > fo =I <e <i 
In the solution to Activity 22(b) this series is used to deduce the following 
Taylor series about 0: 
In(1 — 2) = —a — 42” — 42? — fat , fr —-1<z<1. 


By using the fact that 


In (; =) = In(1 + z) — ln(1 — 2), 


use the two series above to find the Taylor series about 0 for 


In l+z2z 
l-a/)- 


Determine an interval of validity for this series. 
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The result from Activity 25 turns out to be useful for finding an 
approximation for the natural logarithm of any positive number. Any 
positive number t can be expressed in the form t = (1 + £)/(1 — x) for some 
number x in the interval —1 < x < 1. For example, 3 = (1 + 4)/(1 — 4). 
You can see that this is possible for every positive number t by rearranging 
the equation above to make x the subject: 


1+2 
t= 1 
ing FU 
(l-az)t=l+z 
t-at=1l+2 
t-l=at+ca 
t—-l=2(t+1) 
t—1 
r = —. 
t+1 


This final equation gives the value of x corresponding to a positive 
number t. For example, if t = 3, then x = (3 — 1)/(3+ 1) = 4. 


To see that the value of x always turns out to lie in the interval 
—1 < x < 1, note that the expression on the right-hand side of the final 
equation above can be rearranged as follows: 

„t-i _ (2 _, 2 

ee 2 O ttl 
Since t is positive, the value of 2/(t + 1) is positive, and hence x < 1. Also, 
again since t is positive, the value of 2/(t + 1) is less than 2, and hence 
GS Hl. 


Thus in principle you can use the Taylor series found in Activity 25 to find 
an approximation for Int for any number t in the domain (0, co) of the 
function In. 


In contrast, you can use the series for In(1 + x) to find an approximation 
for Int only for 0 < t < 2, since these are the only values of t that can be 
expressed in the form t = 1 + 2 for some z in the interval —1 < x < 1 (see 
page 145). 


For both series, the further x is from 0, the more terms of the series you 
have to evaluate to obtain the desired level of accuracy. However, to find 
the value of Int for a number t for which you could use either series, you 
usually have to evaluate fewer terms of the series found in Activity 25 than 
of the series for In(1 + x) to obtain the desired level of accuracy. 


Activity 26 Finding the value of a particular logarithm 


Find the value of x = (t — 1)/(t + 1) that corresponds to t = 1.5. Hence 
use the Taylor series found in Activity 25, namely 


1 
m (1) = 2g + fr? + bað + 2r to, 


to find the value of In(1.5) to four decimal places. 


In Activity 26, it was necessary to evaluate five successive Taylor 
polynomials in order to find the value of In(1.5) correct to four decimal 
places. If you used the series for In(1 + x) for this task, with x = 0.5, then 
you would need to evaluate 17 successive Taylor polynomials, as mentioned 
earlier (page 145). This illustrates the comment above that you need to 
use fewer terms of the series found in Activity 25. 


You’ve seen that you can add and subtract Taylor series. You can also 
multiply a Taylor series term by term by a non-zero constant. The 
resulting series is valid for every value of x for which the original Taylor 
series is valid. 


For example, you can multiply the Taylor series 
1 1 1 
r E AS: E E 

e =i ras * + ae + ae t , forxzeER, 

by 3, to deduce that 
3 3 3 
r 2 S m3 k oe eee 

3e =3+30+ 5,2 + 3 tae t , forxeR. 
In the next activity you’re asked to use the technique of multiplying a 
Taylor series by a constant, together with substitution, to find the Taylor 


series about 0 for the function 1/(3 + x)?. Here it’s useful to use the 


rearrangement 
1 O 1 O 1 


Activity 27 Finding a Taylor series for 1/(3 + x)? 


You were asked to show in Activity 17 that the Taylor series about 0 for 
the function 1/(1 + x)? is 


1 
ja 1 Oe +3r? ae +., for — lag <i. 
(1+ zx)? 
Use this Taylor series, together with the fact that 
il 1 1 


= = xXx ——_,,, 
(3+2)? 32 (1+ ir)? 


to find the Taylor series about 0 for 1/(3 + x)?, and determine an interval 
of validity for this series. 


You can find the Taylor series about 0 for any function of the form 

(c+ x)“, where c is any positive constant and a is any real number, by 
deducing it from the series for (1 + x)“ using the method of Activity 27. 
That is, you first express (c + x) as c%(1 + 2/c)®. 
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The next example and activity involve using the techniques that you’ve 
met in this section to find the Taylor series about 0 for two further 
standard mathematical functions. The function sinh (usually pronounced 
as ‘shine’ or ‘sine-sh’) is the hyperbolic sine function, and is given by 


sinh z = 4(e” — e7”). 


The function cosh (pronounced just as ‘cosh’) is the hyperbolic cosine 
function, and is given by 


cosha = $ (e” + e7”). 


The graphs of these two functions are shown in Figure 22. 


Figure 22 The graphs of the functions sinh and cosh 


Although you might not expect it from their definitions or their graphs, 
the hyperbolic functions sinh and cosh have many properties analogous to 
those of the trigonometric functions sin and cos. For example, cosh, like 
cos, is an even function and sinh, like sin, is an odd function. Also, the 
derivative of sinh is cosh and 


sinh(x + y) = sinh z cosh y + cosh z sinh y. 


The two properties immediately above are exact analogues of properties of 
trigonometric functions. Some properties of sinh and cosh are similar to, 
but not exactly analogous to, those of sin and cos. For example, for sin 
and cos we have the trigonometric identity sin? x + cos? x = 1, whereas for 
sinh and cosh we have the identity cosh? x — sinh? x = 1. Also, the 
derivative of cos is — sin, whereas the derivative of cosh is sinh. 


Yov’ll learn more about the functions sinh and cosh if you study the 
module MST125 Essential mathematics 2. 


You can find the Taylor series about 0 for the functions sinh and cosh by 
using their definitions, together with the standard Taylor series about 0 for 
the exponential function. You have to use the techniques of adding and 
multiplying by a constant, and the technique of substitution. The series for 
cosh is found in the next example, and you’re asked to find the series for 
sinh in the activity that follows. 


Example 15 Finding the Taylor series about 0 for cosh x 


Find the Taylor series about 0 for the function f(x) = cosh and 
determine an interval of validity for this series. 


Solution 
We use the formula 
cosha = $ (e” + e`”). 


The Taylor series about 0 for e” is 


1 1 1 
rc 2 3 1a eee 
ae + ae ae 7 , forxeER. 
Replacing each occurrence of x by —2 gives 
i 1 
Sie i) G2 Sa One E 
Cue — all t+ 52 a? + ae , forzEeR 
Therefore 
ge ee IG ee eae 
2 72 2! 3! 4! 
T E ae a : for x € R; 
2! 3! 4! j i 
that is, ; i 
Oe Nae a for x € R. 


Activity 28 Finding the Taylor series about 0 for sinh x 


Find the Taylor series about 0 for the function f(x) = sinh z, and 
determine an interval of validity for this series. 


You can see from the solutions to Example 15 and Activity 28 that the 
Taylor series about 0 for cosh has only even powers of x, while that for 
sinh has only odd powers of x. This is because cosh is an even function, 
while sinh is a odd function. 


Notice that the Taylor series about 0 for sinh and cosh are similar to those 
for sin and cos, respectively. The only difference is that all the coefficients 
in the series for cosh x and sinh are positive, whereas the coefficients in 
the series for cos x and sin z alternate in sign. 


Multiplying Taylor series together 


You’ve seen that Taylor series can be added, subtracted and multiplied by 
a non-zero constant. You can also multiply together two Taylor series with 
the same centre. The resulting Taylor series is valid for all values of x for 
which both original Taylor series are valid, and possibly for a larger 
interval of values. 
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The next example illustrates the multiplication of two Taylor series. It 
involves finding the Taylor series about 0 of the product of a polynomial 
and another function. Note that it’s easy to write down the Taylor series 
about 0 of a polynomial, 


F(x) = co + CIT + C£? +- H Cna”, 


because this is already in the form of the series in the box on page 147 
(with a = 0, and the coefficients of all terms from z”! onwards equal 
to 0). It follows that the Taylor series about 0 of a polynomial is the 
polynomial itself! It’s valid for all x € R. 


Example 16 Multiplying Taylor series 


Find the Taylor series about 0 for the function 
=e 


and determine an interval of validity for this series. 


Solution 


The Taylor series for 1 — x is 1 — x, and the Taylor series for 
1/(1+ z), found in Example 11(a), is 


il 
E =le oo ea for — a 1. 
at 
Therefore 
= 
MEato) 


= I(l = pa" =i ky a dan =a doao) 
= (l-r a e aa +29 —-.--) 
= a a doana y 


The Taylor series about 0 for 1 — x and for 1/(1 + x) are valid for 
x € R and for —1 < x < 1, respectively. Hence the Taylor series for 
(1 —2)/(1+ 2) is valid for —1 < x < 1. 


In the next activity you’re asked to carry out two multiplications of Taylor 
series. In each case one of the two series is a polynomial. 
Activity 29 Multiplying Taylor series 


Find the Taylor series about 0 for each of the following functions. In each 
case determine an interval of validity for the series. 


(a) f(x) = z? sinz (b) f(x) =(1+2)cosz 


Earlier you saw that you can multiply a Taylor series by a non-zero 
constant. This is just a special case of the multiplication of two Taylor 
series. For example, you can obtain the Taylor series about 0 for 3e” by 
multiplying together the Taylor series about 0 for the constant function 
f(x) =3 and the Taylor series about 0 for e”; the Taylor series about 0 for 
the function f(x) = 3 is simply 3, since 3 is a polynomial. 


In each case where we’ve multiplied together two Taylor series, one of the 
series had only finitely many non-zero terms; that is, it was a polynomial. 
Multiplying together two Taylor series both of which have infinitely many 
non-zero terms is usually a difficult task, and you won’t be asked to carry 
out any complete multiplications of this type in this module. However, it’s 
fairly straightforward to multiply together the first few terms of two 
infinite Taylor series to find the first few terms of the product Taylor series; 
that is, to find a Taylor polynomial. This is illustrated in the next example. 


Example 17 Finding a Taylor polynomial by multiplication 

Find the cubic Taylor polynomial about 0 for the function 

I= e cosa. 

Solution 

@. Multiply the series in the usual way. At each stage, include 
explicitly only those terms that could eventually result in final terms 
with power 3 or less. Ignore any terms that can lead only to final 


terms with power 4 or more, but use the notation ‘---’ to indicate 
that further terms exist. © 


Using the Taylor series about 0 for e” and for cosx, we obtain 


i r2 r3 i r2 
ees ey ere =a 


e” cosx 


Il 

— 
aS 
pa 

| 

|S 
>| n 
+ 
ay 
+ 

8 
a 
E 

| 

A 
+ 
ooi 


= (1— z? +- -) + (z — iz? +) + (ir?) 
+(e) 
=1+a+(-$4+ t)r’? + (4 i)e t 


Therefore the cubic Taylor polynomial about 0 for f(x) = e” cos x is 


o — 32°. 


You can use a similar method in the next activity. 
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Activity 30 Finding a Taylor polynomial by multiplication 


In Example 11(a), the binomial series was used to show that the Taylor 
series about 0 for 1/(1 + x) is 


1 
1+ 


Use this Taylor series and the Taylor series for sin x to find the cubic 
Taylor polynomial about 0 for the function 


=1-zr+r? -r t+, for -l<2¢<1. 


ya) = Èz, 


4.3 Differentiating and integrating Taylor series 


You’ve seen that the Taylor series about 0 for the function f(x) = sin z is 


Let’s consider the effect of differentiating this series term by term, in the 
way that we would if it had only finitely many terms and so was a 
polynomial. We obtain the series 


You may recognise this series as the Taylor series about 0 for cosx. So by 
differentiating term by term the Taylor series about 0 for f(a) = sin x, we 
obtained the Taylor series about 0 for its derivative, f'(x) = cos. 


This observation suggests that term-by-term differentiation of a Taylor 
series about 0 for a function f gives the Taylor series about 0 for its 
derivative, f’. This conjecture can be verified as follows. 


Let f be a function that’s differentiable infinitely many times at 0, and let 
g = f'. The Taylor series about 0 for f is 


Uh (3) (n) 
Miror £0 2, O94... 4 Om 


Notice here that the general term, in z”, has been written down explicitly 
as part of the series. 


+ 9) 


Differentiating series (9) term by term (keeping in mind that f(0), f’(0), 


f” (0), ... are constants) gives the series 
n (3) (n) 
; n f® 0) JO. a 
=f OEF Oet EE CR 14 


Since g = f’, we have 9(0) = f/(0), g'(0) = f"(0), g”(0) = f(0), and so 
on. Therefore we can write the series above as 

1 n—1) 
g'(0) 2 g?-9 (0) na 
a aa 
This is the Taylor series about 0 for g = f’. (The general term is expressed 
in terms of n — 1 instead of n.) 


g(0) + g'(O)a + 


Taylor series can also be integrated term by term. If the Taylor series 
about 0 for a function f is integrated term by term, then the result is the 
Taylor series about 0 of an antiderivative of f. 


These properties of Taylor series are summarised in the following box. 
Here co is written for f(0), cı for f’(0), c2 for f”(0)/2!, and so on, to 
simplify the notation. 


Differentiating and integrating Taylor series about 0 


Let f be a function that is differentiable infinitely many times at 0. 
If the Taylor series about 0 for f is 


Qj pew tor Par fo teg a, 
then the Taylor series for f’ is 
ei + 2cox + ear + 4ega® +--+ + negc™1+4+---, 


and the Taylor series for any antiderivative of f is of the form 
Cl 9 , 2s Co ntti 
—7 — T aa, —~—_ T eee 
CO aaa Tg ale a a, 
where c is a constant. 
Any interval of validity for the Taylor series for f that is an open 


interval is also an interval of validity for the Taylor series for f’ and 
for any antiderivative of f. 


The results in the box above can be extended to Taylor series with centres 
other than 0, but we won’t need to use such series in this module. 
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Example 18 Differentiating and integrating a Taylor series 

In Example 11(a), the binomial series was used to show that 
il 

Il sb ae 

Also, by either the chain rule or the quotient rule, 


Sl Sa Be lot Sl ae = IL. 


dy ae 1 

dx¢\il+e/ (14+2)?’ 
and by a standard integral and the rule for integrating a function of a 
linear expression, 


il 
= S 
[e In(l+2)+e (#>-1), 


where c is a constant. Use these facts to find the Taylor series about 0 
for each of the following functions. In each case state an interval of 
validity for the series. 


@ (2) = — 


Game (b) f(z) =mn1+2) 
Solution 


(a) Differentiating the series for 1/(1+ x) gives 


Sage Ee a 5 tore = Il <ae< il. 
Multiplying both sides by —1 gives the required Taylor series, 
Tria torn ar Ie 
(b) Integrating the series for 1/(1+ x) gives 
ln(1 + x) 
=c+r-— $x’ + ir’ — irt ir. 5 itore la 


where c is a constant. 


@. To find the value of the constant c, put x = 0 in the equation 
above and solve the resulting equation for c. © 


Taking x = 0 gives ln 1 = c, soc = 0. Therefore the required 
Taylor series is 


ln(1 + x) 
Se patga gatti, for —1 <a. 


An alternative way to find the Taylor series in Example 18(a) is to take 
a = —2 in the binomial series, as was done in Activity 17. 


The result of Example 18(b) is the standard Taylor series for In(1 + x), as 
you'd expect. This shows the connection between this standard series and 
the series for 1/(1 + 2). 


The remaining activities in this subsection require you to differentiate or 
integrate Taylor series. 


Activity 31 Differentiating a Taylor series 


Verify that term-by-term differentiation of the Taylor series about 0 for the 
function f(x) = e” leaves the series unchanged. 


(This result corresponds to the fact that the derivative of e” is e”.) 


Activity 32 Finding the Taylor series about 0 for tan”! x 


In Example 13 you saw that 


— = og og aes, cor =e <1, 
142? 
In Unit 7 you saw that 
d 1 1 
— (tan! x) = —. 
a z) 14+ 2? 


Use integration to deduce the Taylor series about 0 for tan~! x, and state 
an interval of validity for this series. 


Calculating m 


In 1706 the mathematician John Machin used the Taylor series 
about 0 for tan~!, which you were asked to find in Activity 32, to 
calculate the first 100 digits of m. The series is 


tan g= gs- ir’ + ir’ tr +.. : 
An interval of validity for this series is —1 < x < 1, but the series is 
also valid for x = 1. So, since tan`! 1 = iT, we obtain a 


representation for m as four times the sum of a series: 
m=4tan7'1=4(1—4+4-24---). 
Unfortunately it’s not practicable to use this particular series to 


calculate 7 accurately, because 1 is too far from the centre 0 of the 
series. However, Machin discovered the strange-looking formula 


na = 16tan! (4) ZA pi! (35) : 
The values t and z5 are much closer to 0 than 1 is, so relatively few 


terms of the corresponding series need to be evaluated in order to 
calculate tan~! (4) and tan”! (z5); and hence 7, to 100 digits. 
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John Machin was at the University of Cambridge at the same time as 
Brook Taylor, having acted as a private tutor to Taylor beforehand. 
Machin was later a Fellow of the Royal Society and (for 38 years) 
Professor of Astronomy at Gresham College, London. He was also a 
member of the committee to adjudicate the claims of Newton and 
Leibniz to have invented the calculus. 


In the final activity of this unit you’re asked to use both substitution and 
integration to find the first few terms of the Taylor series about 0 for the 
inverse sine function. 


Activity 33 Finding the Taylor series about 0 for f(x) = sin™' x 


In Example 12, the binomial series was used to show that 
1 1x3 ə 1x3x5 3 


=) 1 = TE 
(1+7) 1-38 + zaxa 3x3! x ) 
that is, 
1 
=1—$a+32?— fat. , for -l<a<l. 


(a) By using substitution, find the first four terms of the Taylor series 
about 0 for the function 1/1 — z?. Determine an interval of validity 
for this series. 

(b) You saw in Unit 7 that 

1 
=: 


< (sin! x) = ——— 
l-z 


dg 
By using this fact and integrating the series in part (a) term by term, 
find the first three non-zero terms in the Taylor series about 0 for the 
function f(x) = sin~! z, explicitly evaluating the coefficients. State an 
interval of validity for this series. 


Learning outcomes 


After studying this unit, you should be able to: 


e find Taylor polynomials about particular points for particular 
functions 


e use Taylor polynomials to find approximations for values of functions, 
estimating such values to a particular accuracy 


e find Taylor series about particular points for particular functions 


e use known Taylor series to find further Taylor series by substitution, 
addition, subtraction, multiplication, differentiation and integration, 
and deduce intervals of validity for such series from intervals of 
validity for the original series. 


Solutions to activities 


Solution to Activity 1 
(a) Since f(0) = cos0 = 1, the constant Taylor 


polynomial about 0 for f(x) = cosa is p(x) = 1. 


The approximation is 1 in each case; that is, 
p(0.01)=1 and p(0.1)=1. 

The corresponding remainders are, to five 

decimal places, 


cos(0.01) — p(0.01) = cos(0.01) — 1 = —0.00005, 


cos(0.1) — p(0.1) = cos(0.1) — 1 = —0.005 00. 
(b) Since f(1) = In1 = 0, the constant Taylor 
polynomial about 1 for f(x) = ln zv is p(x) = 0. 
The approximation is 0 in each case; that is, 
p(1.01)=0 and p(1.1) = 0. 


The corresponding remainders are, to five 
decimal places, 


In(1.01) — p(1.01) = In(1.01) — 0 = 0.009 95, 
In(1.1) — p(1.1) = In(1.1) — 0 = 0.095 31. 


Solution to Activity 2 
(a) We have f(x) = sinz, so 
f'(x) = cosg. 
Hence 
f(0)=sin0=0 and f'(0)=cos0=1. 
Thus the linear Taylor polynomial about 0 for 
the sine function is 
p(x) = f (0) + f'(0)x 
=O0O+2 
=i: 
(b) The linear Taylor polynomial gives the 
approximations 
p(0.25) = 0.25 and (0.5) = 0.5. 
A calculator gives (to 4 d.p.) 
sin(0.25) = 0.2474 and sin(0.5) = 0.4794. 
Hence the two remainders are 
sin(0.25) — 0.25 = —0.0026, 
sin(0.5) — 0.5 = —0.0206. 


The magnitude of the remainder is about 8 
times larger at x = 0.5 than it is at x = 0.25. 


Solutions to activities 


Solution to Activity 3 
(a) We have f(x) = cosx, so 
f'(z) = —sinz. 
Hence 
f(0)=cos0=1 and f'(0)=-—sin0 = 0. 
Thus the linear Taylor polynomial about 0 for 
the cosine function is 
p(x) = f(0) + f'(0) 
=1+02 
= 1, 

(b) The approximation for cos(0.2) given by the 
linear Taylor polynomial p is p(0.2) = 1. To 
four decimal places, the remainder is 

cos(0.2) — 1 = —0.0199. 


Solution to Activity 4 
(a) We have f(r) = (1+ 2)'/?, so 
f(a) = 3042). 
Hence 
fO=020)" =1 
and 
f'(0) = §3(1 +0)? = 5. 
Thus the linear Taylor polynomial about 0 for 
f(z) = (14+ x)! is 
p(x) = f(0) + f'(0)x 
=1+ iz, as required. 
(b) We have 
V1.01 = (1 + 0.01)" = f (0.01). 
The corresponding approximation for v1.01 is 
p(0.01) = 1 + 4 x 0.01 = 1.005. 
To six decimal places, the remainder is 
(0.01) — p(0.01) = 1.004.988 — 1.005 
= —0.000 012. 


Solution to Activity 5 
We have f(x) = e”, so 


faj = er 
Hence 
f()=e'=e and f'(1)=e' =e. 
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Thus the linear Taylor polynomial about 1 for 
f(x) = e” is 
p(x) = f1) + FA) — 1) 
=et+e(x—-1) 


= eT. 


Solution to Activity 6 


(a) We have f(x) = cos z, so 


f'(x)=-—singx and f”(x)= -— cosz. 
Hence 
f(0)=cos0=1, f'(0)=-—sin0=0 


and 

f” (0) = — cos 0 = —1. 
Thus the quadratic Taylor polynomial about 0 
for the cosine function is 


p(x) = f(0) + f'(0)z + af" (Ox? 
=14+02 - $27 
== $x", 
(b) The corresponding approximation to cos(0.2) is 
p(0.2) = 1 — $(0.2)? = 0.98. 
To six decimal places, the remainder is 
cos(0.2) — p(0.2) = 0.980 067 — 0.98 
= 0.000 067. 


This remainder has much smaller magnitude 
than that found in Activity 3(b), so the 
approximation to cos(0.2) by using the 
quadratic Taylor polynomial p(x) = 1 — ix is 
much better than the approximation by using 
the linear Taylor polynomial p(x) = 1. 


2 


Solution to Activity 7 
We have f(x) = sin z, so 


f'(x)=cosx and f”(x)=-—sinz. 
Hence 
f(0)=sin0=0, f'(0)=cos0=1 


and 

f” (0) =—sin0 = 0. 
Thus the quadratic Taylor polynomial about 0 for 
the sine function is 


p(x) = f(0) + f'(x + 5 f"(0)x? 
=0+2+0z2" 


= T. 
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Solution to Activity 8 
We have f(x) = e”, so 


Faj=e and F= 
Hence 

fa)=et =e, fA)=et=e 
and 

a =e. 


Thus the quadratic Taylor polynomial about 1 for 
f(x) = e” is 
p(x) = f(1) + re- ae) 
=e+e(z—1)+ e(z — 1}. 
(By multiplying out the squared brackets and 
collecting like terms, this can also be written as 
ple) = se (1 + x’) .) 
Solution to Activity 9 
(a) To find the quartic Taylor polynomial about 0 


for f(x) = cosx, we need to evaluate f(0), 
f'(0), f’(0), f (0) and f (0). We have: 


f(x) =cosa,  f(0)=1; 
f(a) =—sine, f'(0)=0; 
f"(x) =—cosx, f"(0) = —1; 
fO(x)=sinz, f®(0) =0; 
f(x) =cosx, f“(0)=1. 


Hence the quartic Taylor polynomial about 0 

for the cosine function is 
"(0 (3) 0 

p(x) = F0) + Oe + LO? + 8 


(b) Similarly, to find the quartic Taylor polynomial 
about 0 for f(x) = sin z, we need to evaluate 
f(0), £'(0), £(0), f (0) and f (0). We have: 


f(x) = sinz, f(O) =0; 
f'(x) = coax, fF O=i; 
f”(x)=-—singz, f”(0)= 0; 
Pej =—-cosz, FO) =-1; 
f(z) = sing, f®(0)=0. 


Hence the quartic Taylor polynomial about 0 for 
the sine function is 


(3) (9 
p(x) = f(0) + f'(O)a+ £"(0) 2 45° 0) 3 


2! 3! 
fOO 4 
+ ri T 
1 
=g- >r’ 
3! 
=g— ir”. 


(You may have noticed that the quartic Taylor 
polynomial about 0 for the cosine function contains 
terms in even powers of x only, whereas that for the 
sine function contains terms in odd powers of x only. 
This property is explained later in the subsection.) 


Solution to Activity 10 
(a) (i) Applying the chain rule gives 


= (-k)(1— 2)! x È(1 - 2) 


= (—k)(1— 2)" x (-1) 
= k(1 — x) (t+) 
B k 
-TT 
as required. 
(ii) Applying the result from part (a)(i), 
with k = 1, 2,3, gives 


fe)= r=, FO) =1; 
rosg Os 
razg "O 
f(2) sr 
= (0) =3! 


Solutions to activities 


(iii) Hence the cubic Taylor polynomial about 0 


for f is 
kd 
0 
pla) = £0) + Oje + LO? 
fOO) a 
T 3] x 
3! 5 
SEPET 2g tar 
=1+r+r? +r. 


(b) (i) The 4th derivative of f(x) is 


$9) = 2 (Gam) 


_ 4x 3! = 4! 
o (l=) (L-z)? 
the 5th derivative is 


d 4! 
few )= dx (z — =) 
o 5x4 | 5! 


{isg (=r 
and so on. The pattern is now clear; 
the nth derivative will be 


n! 
fre) = T ay 


(ii) By putting x = 0 in the formula for 
f™ (x) above, we obtain 


f= 


n! 
(1 — 0)r+1 


(iii) Hence the Taylor polynomial of degree n 
about 0 for f is 


ple) = f0) + FOr + 
an y 


=n. 


(0), 


=l+aotartait--- +a". 
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Solution to Activity 11 


To find the cubic Taylor polynomial about 7/6 for 
f(x) = sina, we evaluate f(7/6), f’(7/6), f’’(7/6) 
and f‘) (1/6), as follows: 


f(x) = sinz, 
J (2) = cose, 
Pa)= 
fO(z) =- cosg, fO (=) = —-3/3. 


Hence the cubic Taylor polynomial about 7/6 for 
the sine function is 


pa) = F(5) +r (a) (2-5) 


—sin a, 


Solution to Activity 13 
Using the given a a eas 


Pr(& jerata ot cab pe pa” 


3! no? 
and calculating wie to six decimal places, we 
obtain 


pı(—0.05) = 1 + (—0.05) = 0.95 


p2(—0.05) = pı (—0.05) + $(—0.05)? 
= 0.95125 

p3(—0.05) = p2(—0.05) + ¢(—0.05)* 
= 0.951229 (to 6 d.p.) 

pa(—0.05) = p3(—0.05) + 4 (—0.05)“ 
= 0.951229 (to 6 d.p.). 


The values of p3(—0.05) and p4(—0.05) agree to six 
decimal places, so it is likely that 


e7005 = 0.9512 
to four decimal places. (This is indeed the case.) 
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Solution to Activity 14 


Using the given Taylor polynomials, and calculating 
values to eight decimal places, we obtain 


po(0.2)=1 


1 
p2(0.2) = 1 — F = = 0.98 


pa(0.2) = po(0.2) + a L oo. 2)* = 0.980 066 67 

pe(0.2) = pa(0.2) — Al  (0.2)8 = 0.980 066 58 
1 

pg (0.2) = ps (0.2) + a (0.2) = 0.980 066 58. 


The values of pe(0.2) and ps(0.2) agree to eight 
decimal places, so it is likely that 
cos(0.2) = 0.980 067 


to six decimal places. (This is indeed the case.) 


Solution to Activity 15 


(a) From the solution to Activity 9(a), we can see 
that the values of f™ (0) form the repeating 
sequence 

Oat 0e 


Hence the Taylor series about 0 for f(x) = cos x 
is 


Similarly, from the solution to Activity 9(b), we 
can see that the values of f‘(0) form the 
repeating sequence 

0,1,0, —1,0,.... 
Hence the Taylor series about 0 for f(x) = sin x 
is 


Solution to Activity 16 


We proceed initially in a similar way to the solution 
to Activity 9(b). To find the Taylor series about 0 
for f(x) = sin z, we need to evaluate f(7/2), 
f'(m/2), f(n/2), FO (1/2), .... We have: 


2 
f'(z)=-sinze, f” (5) =e 
f(e) =- cosg, fO (5) = 
f%a)=sine, fO (5) = 4, 


The values of f™ (1/2) form the repeating sequence 
1,0, —1,0,1,.... 

Hence, from formula (5), the Taylor series 

about 7/2 for f(x) = sin z is 


1 1 2 are myt. J 1T\ 6 
TEEPEE 


1 T\8 
T 


Solution to Activity 17 
Since 1/(1 + x)? = (1+2)~?, we take a = —2 
in the binomial series, to give 


1 (-2)(-3) a 
2! 


+ 


= 1 + (—2)x + 


(+7)? | 
DEIN sy.. 


= 1 — 2g + 3x? — 4r? + 5a? oe, 
This Taylor series is valid for —1 <a < 1. 


Solution to Activity 18 


Taking a = 4 in the binomial series gives 


ICD 9 HCDCD o 


(ay Slot +4 31 z 
1 1 3 5 
HEDEDCD a 
1 1x3 
— 1 2 j 3 
= tage pao” TF asi 
1x3x5 4 
T xa 7 


This Taylor series is valid for —1 <a < 1. 


Solution to Activity 19 


The cubic Taylor polynomial about 0 for the 

function f(x) = In(1 + x) is obtained from the 

Taylor series for In(1 + x) by deleting all the terms 
1 


after iz’, to give 


palz) =z- 4x7 + ir’. 


Solutions to activities 


Solution to Activity 20 


Using the series from the solution to Activity 18, 
we obtain, to five decimal places, 


pı(0.1) = 1 + $(0.1) = 1.05 


1 
p2(0.1) = pı (0.1) — EEI (0.1)? = 1.048 75 


9 
1x3 
1) = .1) + —— (0.1)? = 1.048 81 
p3(0.1) = p2(0.1) + 535; (0-1) 
1x3x5 4 
l= 1) — ——— (0.1) = 1.048 81. 
pa(0.1) = p3(0.1) - Spay (0-1) 048 8 


The values of p3(0.1) and p4(0.1) agree to five 
decimal places, so it is likely that 


v1.1 = 1.049 


to three decimal places. (This is indeed the case.) 


Solution to Activity 22 


(a) The Taylor series about 0 for 1/(1 — x), from 

page 142, is 

JF hy he a, , 

l=g 
for —1 < x < 1. Replacing each occurrence of x 
by —2x and using the fact that 
1/(1 — (—2x)) = 1/(1 + 2x) gives 

1 


T a ae ee) eee 


= 1 — 2g + 4r? — 8r? +. 
This is the Taylor series about 0 for 1/(1 + 2x). 
It is valid for —1 < —2x < 1; that is, for 
—4 < T< 5. 
(This part could also be answered by taking the 
binomial series from page 142 for (1+ x)“, with 
qa = —1, and then replacing each occurrence 
of x by 2z.) 
(b) The Taylor series about 0 for In(1+ 2), from 
page 142, is 
In(1 +x) =a — $2? + ir’ — 


for -l<a<l. 


1,4 
qr EES 


Replacing each occurrence of x by —x and using 
the fact that In(1 + (—x)) = ln(1 — x) gives 


In(1 — zx) 


This is the Taylor series about 0 for In(1 — x). 
It is valid for —1 < —a” < 1; that is, for 
=I << 
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(c) The Taylor series about 0 for In(1+ x), from 
page 142, is 
In(L+2) =x $27 + $2°-—Gatt---, 
for —1 < x < 1. Replacing each occurrence of x 
by 32 gives 


In(1 + 3x) 

= (3x) — 3(3x)? + 3(3x)° — 7(3x)* + 
3? 33 34 

samat tgrang te 


This is the Taylor series about 0 for In(1 + 32). 
It is valid for —1 < 3x < 1; that is, for 
—+ <T< $. 


(d) The Taylor series about 0 for e”, from page 142, 


is 
1 1 
Cel pat tT te for x € R. 
Replacing each occurrence of x by x3 gives the 
Taylor series about 0 for e7”: 


1 44x90 
x) + 3 
1 


1 
— 3 6 9 S 


1 


e™ =1+ (z8) + (H 


Solution to Activity 23 


The Taylor series about 0 for 1/(1 + x) is given as 
1 

Lg 
for —1 < x < 1. Replacing each occurrence of x by 
x — 1 in this equation, we obtain 

1 

1+ (2-1) 

= 1 — (z—1) + (z—1})}? Se)? +- 
But 1/(1 + (x — 1)) = 1/x. Therefore the Taylor 
series about 1 for 1/z is 

1 


Pamela ia ek et ae 


This Taylor series is valid for —1 < x — 1 < 1; that 
is, forO < xz < 2. 


=1-r +g rt, 


Solution to Activity 24 
In each part, we use the Taylor series about 0 for 
1/(1 + x), from page 142, which is 

1 


1l+a 
for -l<a<l. 


=1l-at+a?—-a2+--, 
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for x € R. 


(a) Replacing each occurrence of x by $a in this 
equation, we obtain 


1 2 3 
rpg =} Ge) + Ge)” - Gay to 


— 1-2 4,2 _ 8734 .., 
=1- 32+ 52 27% t : 


It follows that the Taylor series about 0 for 
g(x) = 3/(3 + 2x) = 1/ (1 + 32) is 
3 
3+ 22 
This Taylor series is valid for —1 < 2x < 1; that 
is, for -3 SUE 3. 


— {2 4,2 8734... 
=1= 38+ 5 ae + i 


(b) Replacing each occurrence of x by 2(x + 1) in 
the series for 1/(1 + x), we obtain 
1 


1+2(¢+1) 


= 1 — (2(x + 1)) + (2(z + 1)? — (2(z +1)’ +- 


=1—2(x+1)+4(£ +1) —8(£ +1) 45, 
It follows that the Taylor series about —1 for 
1/(3 + 2x) = 1/(1 + 2(a@ + 1)) is 


1 
=1-—2 1)+4 1)? 
TET (x+1)+4(x+1) 


— 8(z +1) +- 
This Taylor series is valid for 
—1 < 2(x + 1) < 1; that is, for ~ < £ +1 < å 
or equivalently os [E< —t. 


Solution to Activity 25 


Using the given Taylor series about 0 for In(1 + x) 
and In(1 — x), we obtain 


la(1 + zx) —In(1- zx) 


as (g — lg? p lg — igt p ig —... 
= (x zL + 3h qv + 5a ) 
Lage 13 1,4 15 
(“a= ge = ga ea g e] 


= 2g + 209 + 20° +; 
that is, 
1+2 
n(**) = 2g + r? + 20° +o. 


The Taylor series for In(1 + x) and ln(1 — x) are 


each valid for —1 < x < 1, so the Taylor series 
derived here is also valid for —1 < a < 1. 


Solutions to activities 


Solution to Activity 26 Therefore 
If t = 1.5, then the corresponding value of x is Le =g j= 1 idene x 4 z? i zt irre 
2 ee 2 2 3 4 
1541 25 5 | i a2 go gf 
Using the series given, we obtain, to six decimal TF 2 3l + A 
places, la ? 
pi(0.2) = 2 x 0.2 = 0.4 Sikt at Oy 
p3(0.2) = pı (0.2) + 2(0.2)* = 0.405 333 for x € R; that is, 
1 
ps(0.2) = p3(0.2) + 2(0. 2)° = 0.405 461 sinha = z + = z =r’ tia +---, forzeR. 
0.2 0.2) + 2(0.2)" = 0.405 465 : 
Ee) 2) as ) Solution to Activity 29 
po(0.2) = p7(0.2) + (0.2)? = 0.405 465. 


(a) Using the Taylor series about 0 for sin x, from 


The values of p7(0.2) and po(0.2) agree to pace 149° we obtain 


six decimal places, so it is likely that 


1 iL 
I 2 2 = r? 3 ea 
MGS) Sip) anda alae (< rog = 
1— 0.2 
to four decimal places. (This is indeed the case.) = 7 — 2° e+ z eg’ —---. frreR. 
3! 5! , 
Solution to Activity 27 (b) Using the Taylor series about 0 for cos x, from 
The given Taylor series is page 142, we obtain 
1 2 3 1 1 4 
mea es omens rae a” ta 
for -l<a<l. 1 
: ; : ? =i ia — 
Using this series and replacing each occurrence of x 2! 4! 
1 . 
by 32 gives a ae 
i 1 1 aA nr il aor le 
(3+)? 32 (1+ 32)? 
1 
1 1 f L2 1,,\3 = (1 = x + =" = ) 
~ 39 (1 2 (30) +3 (37 4 (30) + ) 2! 4! 
—_ 1t 2_4 3 l s, 1 
32 (1 Cr aot 3a + +) +(e- 52 + a" = 
1 2 32 43 1 1 1 1 
ee a Seg ee pees = 2 3 4 5 
ge ptg gT ange a gyn 
This Taylor series is valid for —1 < ir < 1; that is, for eR. 
for =3 < 2 <3. : a 
Solution to Activity 30 
Solution to Activity 28 Using the Taylor series about 0 for 1/(1 + x) and 
We use the formula sin xz, and ignoring all terms that lead to 4th or 
Soe = 1 (ee —e-*), higher powers of x, we obtain 
The Taylor series about 0 for e” is — =(l-¢+27- (2 = La m -) 
2 3 4 +r 3! 
@al+et rtr i rie for x € R. 1 
On replacing each occurrence of x by —x, we obtain 3! 
2 3 4 
Paap, wrger TH+) Pte) 
3! A! fo ee 
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Hence the cubic Taylor polynomial about 0 for 
f(z) = (sinz)/(14+ x) is 
p(z) = x- x? + 32°. 


Solution to Activity 31 


The ae series aan 0 for e” is 
1 

3] z? + T zt +. 

Dieting this series gives 


3 4 
ata Egr T: 


=] +7 +5 a a + a x +- 
which is aso same series, as seit 


l+o+5 24i 


TEER 


Solution to Activity 32 


We have 
1 
1 +z? 
for —1 < x < 1. Integrating both sides of this 
equation gives 


|= gar = | (L-a +at— a +) da 


that is, 
—1 


=1- g? + rt- r+, 


tan TETE T TETE 


for —1 < x < 1, where c is a constant. Taking x = 0 


gives tan™1 0 = c, so c = 0. Therefore 


tant g = g ġa? + ir? drl to, 


for -l<a<l. 


170 


Solution to Activity 33 


(a) 


The given series is 
1 
vl+2 
for —1 < x < 1. Replacing each occurrence of x 
by —2? gives 


=1-jr+ su 2 Žr’ mes 


=14 fa? + 324 Ša 
This Taylor series is valid for —1 < —a? < 1. 
The left-hand inequality here is —1 < —2?, 
which is equivalent to 1 > x?; that is, 
—1 <x < 1. The right-hand inequality is 
—x? < 1, which is equivalent to x? > —1 and 
therefore does not place any restriction on z, 
since the square of any real number is 
non-negative. Thus this Taylor series is valid for 
-l<a<l. 


Integrating both sides of the equation above 
gives 

1 tO, 3,4 À 
that is, 


sin ‘g=c+a+jar+3 Pee 


for —1 < x < 1, where c is a PEPIN Toe 

x = 0 gives sin™! 0 = c, soc = 0. Therefore 
sint g = r + ir? + pat... j 

for -1 <z <1. 


=1- }(-2?) + $O) - A + 
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Unit 12 


Complex numbers 


Introduction 


Introduction 


As you know, every positive real number has two square roots, one positive 
and one negative. For example, the positive real number 9 has square 
roots 3 and —3. Another way to express this fact is to say that the 
equation 


r =9 


has solutions 3 and —3. Suppose now that you want to find a square root 
of a negative number, such as —1. That is, you want to solve the equation 


r? =—l1. 


You may believe that this equation has no solutions; after all, the square of 
any real number is positive or zero. In this unit you'll learn about a system 
of numbers, known as the complex numbers, in which —1 and all other 
negative numbers have square roots. 


The complex numbers are created by first introducing a new number, 
written as i, with the property that i? = —1 (so i is a square root of —1). 
It may seem like cheating to simply define 7 in this way, but you’ll see that 
the resulting new system of numbers is incredibly powerful and useful. The 
other complex numbers are created by multiplying i by any real number, 
and then adding any real number. For example, the following are complex 
numbers: 


344i, —V/2+99i and 0.7—7i. 


Each real number can also be considered to be a complex number 
(3 is 3+ 0i, for example). The number 7 is a solution of the equation 


x? = —1, and there’s a second solution of this equation, namely —i. 


The first publication to include a reference to complex numbers was 
the book Ars Magna (1545) by Gerolamo Cardano (1501-1576). The 
possibility of using complex numbers first emerged when Italian 
mathematicians were developing methods for solving cubic equations, 
such as x? + z? + 6x +3 = 0. Both Scipione del Ferro (1465-1526) 
and Niccolò Fontana Tartaglia (1499/1500-1557) independently 
discovered how to solve any cubic equation, using methods that 
sometimes involve complex numbers. Tartaglia revealed his method in 
secret to Cardano, who later published it in his Ars Magna. This 
angered Tartaglia, who insulted Cardano for revealing the method. 
Cardano, in his defence, claimed to have also seen del Ferro’s method, 
which was unpublished, and so he no longer felt obliged to keep the 
method of solving cubic equations secret. 


Niccolo Fontana Tartaglia 


At first, complex numbers may seem abstract, because they don’t obviously 
represent physical quantities in the way that real numbers do. However, 
they’re of fundamental importance in mathematics — as you'll begin to see 
in this unit — and they’re an essential tool in many scientific disciplines, 
such as electromagnetism, fluid dynamics and quantum mechanics. 
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Figure 1 A transistor 


Figure 3 A fractal in the 
complex plane 
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Quantum mechanics, for instance, is about the motion of very small 
objects, such as atoms. The foundational equations of the subject involve 
complex numbers. 


Discoveries in quantum mechanics led to the development of the modern 
transistor, midway through the last century. Transistors (one is shown in 
Figure 1) are devices used to control current in circuits, and are an 
essential part of electronic systems, such as those found in cars, computers 
and portable media players. 


It’s instructive to think of complex numbers geometrically, using the 
complex plane, which is a plane such as that shown in Figure 2. Each 
complex number is represented by a point on the plane. For instance, the 
complex number 2 + 32 is represented by the point with coordinates (2,3). 
You'll learn more about the complex plane in Section 2. 


eet 3 


Figure 2 The complex plane 


In higher-level modules involving complex numbers you can find out how 
some simple formulas involving complex numbers give rise to fractals in 

the complex plane, which are intricate shapes with repetitive structures, 

such as that shown in Figure 3. 


Not only do complex numbers have fascinating geometric properties that 
give rise to beautiful fractals, but the system of complex numbers also has 
some useful algebraic properties that the system of real numbers lacks. For 
instance, you saw in Unit 2 that, if you’re working only with the real 
numbers, then some quadratic equations have no solutions. An example is 
the equation z? — 4x + 5 = 0: if you write this equation as (x — 2)? +1 = 0, 
then you can see that for every real number zx the left-hand side is greater 
than or equal to 1, and hence the equation has no solutions that are real 
numbers. You’ll see in Section 3 that if you’re allowed to use complex 
numbers, then every quadratic equation has at least one solution. In fact, 
you'll meet an even stronger result: if you’re allowed to use complex 
numbers, then every polynomial equation has at least one solution. A 
polynomial equation is an equation of the form ‘polynomial expression = 0’ 
(where the polynomial expression has degree at least 1), such as 


5z +8=0 or a’ — 132° + 34° -2=0. 


1 Arithmetic with complex numbers 


In Section 4 you’ll be shown Euler’s formula, 

e? = cos + isin 8. 
This is a hugely important equation that relates the exponential function 
to the trigonometric functions. Euler’s formula allows you to write 
complex numbers in a helpful manner, called exponential form, which is 
widely used in mathematics, engineering and other scientific subjects. 


Some of the blue boxes in this unit give you a taste of some of the ways in 
which complex numbers are used in higher-level mathematics. Remember 

that these boxes are only for your interest; you won’t be assessed on their 

contents. 


1 Arithmetic with complex numbers 


In this section you'll learn the details of what complex numbers are, and 
how you can add, subtract, multiply and divide them. You'll also learn 
about another arithmetic operation, called complex conjugation, which is 
particular to the complex numbers. 


1.1 What are complex numbers? 


To define the complex numbers we start by considering the equation 
2 
a =-l. 


This equation has no solutions that are real numbers, because the square 
of any real number is non-negative. To overcome this problem, we 
introduce a new number, which we call 7, and declare that 7 is a solution of 
the equation. 


The number i is defined to have the property i? = —1. 


So 7 is a square root of —1. You learned in Section 4 of Unit 1 that 

the symbol v“ is used to denote the non-negative square root of a 
non-negative real number. In other texts you may see i written as /—1, 
even though neither —1 nor 7 is a non-negative real number. This notation 
isn’t used in this module, as it can be misleading. To see why, remember 
the rule Jab = vab, also from Unit 1, which is true when a and b are 
non-negative real numbers. If you try to apply this rule with a = b = —1, 
then you obtain the incorrect statement 


VZIV-I = y(-1) x (-1) = V1 = 1. 
This statement is wrong because /—1./—1 should equal —1, not 1. 


To avoid this kind of pitfall, it’s best not to use the notation y —1, except 
in certain particular circumstances, such as those described in 
Subsection 3.1, where you’ll meet expressions involving + 
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Let’s now look at how the complex numbers are obtained by combining 
this new number 7 with the real numbers. For instance, you can multiply i 
by a real number such as 5 to give another number, 52. You can then add 
5i to a real number such as 3 to give another number, 3 + 5i. The complex 
numbers are all the numbers that you can obtain in this way. 


Complex numbers 


A complex number is a number of the form a + bi, where a and b 


are real numbers. 


The set of all complex numbers is denoted by C. 


For example, —7 + 3i and 3 + 192 are complex numbers. Using the usual 
conventions of algebra, we can write some complex numbers in a form 
simpler than a + bi. For example, we write 


45+(-1l)i as 


341i as 
6 + 01 as 
0+ (-1)i as 
0+ 01 as 


45 —1li 
3+i 

6 

=4 


0. 


Note that since 7 is a square root of —1, the number —7 is also a square 
root of —1, as you’d expect. You'll see this confirmed later. 


1 Arithmetic with complex numbers 


In the rest of this section, you’ll meet some of the basic properties of 
complex numbers, and practise manipulating them. To start with, here are 
two important definitions. 


Real and imaginary parts 


For any complex number z = a + bi, the real number a is called the 
real part of z and the real number 0 is called the imaginary part 
of z. We write 


Re(z)=a and Im(z)— 6: 


For example, 

Re(—7+ 3i)= -7 and Im(—7+ 3i) =3. 
Also, 

Re(5i) =0 and Im(5z) =5. 


Notice that it is b, and not bi, that is the imaginary part of a + bi. For 
example, the imaginary part of —7 + 3i is 3, not 37. 


If the real part of a complex number is 0, then the complex number is 
sometimes called an imaginary number or a purely imaginary 
number. For instance, the complex numbers 57 and —i are imaginary 
numbers. 


If the imaginary part of a complex number is 0, then that complex number 
is in fact a real number. For instance, the complex number 5 (which you 
could write as 5 + 07) is also a real number. In the same way, every real 
number is a complex number. This implies that the set of real numbers is 
a subset of the set of complex numbers; that is, R C C. 


Activity 1 /dentifying real and imaginary parts 


Write down the real and imaginary part of each of the following complex 
numbers. 


(a) 249 (b)4 (e) -7i (d)0 (ei (f) 1-3 


You can also write complex numbers in the form a + ib, which is equivalent 
to a+ bi, and both forms are used. Your choice may depend on the nature 
of the real number b. For instance, you may prefer to write 1 + i/2 rather 
than 1 + V2i, to avoid possible confusion with 1 + Vi. 
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Francis Maseres (1731-1824) 
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Over the centuries, mathematicians have struggled with definitions of 
numbers. Few people have trouble understanding the natural 
numbers 1,2,3,..., but the concept of zero was more troublesome. 
One of the first people to bring the idea to Europe was Fibonacci 
(Leonardo of Pisa) in his Liber Abaci (1202), the text that spread the 
Hindu—Arabic numeral system through Europe. Even so, the 

symbol 0 was not widely used in Europe until the seventeenth 
century, and in 1759 the English mathematician Francis Maseres 
wrote of the negative numbers that they 


darken the very whole doctrines of the equations and make dark 

of the things which are in their nature excessively obvious and 

simple. 
Likewise, many mathematicians at first doubted the validity of 
complex numbers, which Cardano had described in Ars Magna as 
‘some recondite third kind of thing’. In fact, the renowned French 
philosopher and mathematician René Descartes (1596-1650) coined 
the term imaginary numbers for real number multiples of i because he 
considered them to be illusory. 


Today we’re aware of the many uses of different types of numbers, 
and the foundations of mathematics are well established, so there’s no 
longer any doubt about the validity of zero, negative numbers or 
complex numbers. 


1.2 Adding and subtracting complex numbers 


You can add or subtract complex numbers by adding or subtracting their 
real and imaginary parts separately. For example, 


(5+6i)+ (3+2) = (5+3) + (642)i =848i, 
y —— 


and 


add real add imaginary 
parts parts 
(5+6i)—(3+2i)= (5-3) + (6-2)4 =2+4i. 
— —— 
subtract subtract 
real parts imaginary parts 


In essence, to add or subtract complex numbers you treat 7 as a variable 
and add or subtract in the normal way. 


Example 1 Adding and subtracting complex numbers 
Let z = 7 + 19i and w = 13 — 107. Work out z+ w and z — w. 


1 Arithmetic with complex numbers 


Activity 2 Adding and subtracting complex numbers 


Work out z + w and z — w for each case below. 
(a) z=2+5i, w=-7+ 13i (b) z= —4i, w= -9i 
(c) z=3-Ti, w=3-—īi (d) z=3+7i, w=3-Ti 


(e) z=- 4i, w=-}+4i (f) z=1.2, w=3.4i 


Many familiar rules for adding and subtracting real numbers also apply to 
complex numbers. For instance, the order in which you add two complex 
numbers z and w doesn’t matter: 


Z+w=wt+Z. 
Also, any three complex numbers u, v and w satisfy 
(utv)+w=ut(ut+w). 


You’ve met similar rules before, when you learned about addition of 
vectors, in Unit 5, and addition of matrices, in Unit 9. Recall that an 
operation (in this case, addition of complex numbers) that obeys the first 
rule is said to be commutative, and an operation that obeys the second 
rule is said to be associative. Together the two rules tell you that you can 
add several complex numbers in any order that you choose. 


The number 0 has the same role in the arithmetic of complex numbers as 
it does in the arithmetic of real numbers, in that adding 0 to any number 
leaves that number unchanged. 


You should approach the next activity using the usual rules of algebra, 
remembering to treat 7 like a variable. 


Activity 3 Adding and subtracting several complex numbers 


Let u = 4 + 62, v = —3 + 52 and w = 2 — i. Work out the following. 
(a) u+v+w (b) w+v+u (c) u— (v+ w) (d) u— (v — w) 
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1.3 Multiplying complex numbers 


You can multiply two complex numbers using the usual rules of algebra 
and the fact that i? = —1, as shown in the next example. 


As usual, you can also write products using the x symbol. For instance, 
the product (3 + 27)(5 + i) can also be written as 


(3 + 2i) x (5 +å). 


Activity 4 Multiplying complex numbers 


Find the following products of complex numbers. 
(a) (1 +3i)(2 + 4i) (b) (—2 + 3i) (4 — 7i) (c) 3i(4— 5i) 
(da) 7(—2+ 5i) (e) (2-3)(2+3i) ®© (4 +0) (+4 


Multiplication of complex numbers, like addition of complex numbers, is 
both commutative and associative. This means that, for any complex 
numbers z and w, 


zZXw=wx Zz, 
and, for any complex numbers u, v and w, 


ux (v x w) = (u x v) x w. 
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Multiplication of complex numbers is also distributive over addition of 
complex numbers; that is, for any complex numbers u, v and w, 


ux (vt+w)=uxvtux w. 


When you add and multiply real numbers, you probably apply the 
commutative, associative and distributive laws without thinking about 
them. You should be comfortable doing the same for complex numbers. 


The numbers 0 and 1 play the same roles in the multiplication of complex 
numbers as they do in the multiplication of real numbers. That is, 
multiplying a number by 0 gives the answer 0, and multiplying a number 
by 1 leaves that number unchanged. 


Activity 5 Adding and multiplying several complex numbers 


Let u = 1 + 2i, v = 4 — 3i and w = —i. Work out the following. 
(a) u(v +w) (b) uv + uw (c) www 


When multiplying a complex number z by itself, you should write z?, 
rather than zz or z x z, just as for real numbers. Other positive integer 
powers of a complex number z, such as z? or z100, are defined in the usual 
way. When z is not zero, the zeroth power z° of z is defined to be 1, just 
as for real numbers. You'll learn about negative integer powers later on. 


The index laws from Section 4 of Unit 1 continue to hold when the base 
numbers are complex numbers and the powers are positive integers. For 
example, one of these laws for complex numbers states that, if z and w are 
complex numbers and n is a positive integer, then 


(20)? =g" wr 


In fact, once you learn about negative powers you'll see that these laws 
hold when the powers are any integers (not necessarily positive integers). 


Activity 6 Working out powers of i 


Work out i°, il, i7, i3, if, and if. Predict the pattern that would emerge 
if you continued to work out higher powers of i. 


As you learn more about complex numbers, you'll come to see that the 
system of complex numbers has richer properties than the system of real 
numbers. One example of this is in taking square roots of numbers. 
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Gauss in his prime 
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You know that each positive real number has two square roots. For 
example, the number 3 has the two square roots +3. If you’re working 
with the complex numbers, then it’s also true that each negative real 
number has two square roots. For example, the number —3 has the two 
square roots +i/3, because 


(iv3) =; (v) ne ee 


an (-iv3)” = (ix (-v3)) =? (-v3) fi) a 


As you'll see later, these are the only two square roots of —3. In general, 
we have the following useful fact. 


Square roots of a negative real number 


If d is a positive real number, then the square roots of —d are tiVd. 


For example, the square roots of —4 are +iv/4, that is, +2i. 


Activity 7 Checking the square roots of a negative number 


Show that 3i and —3i are both square roots of —9. 


Gaussian integers 


As you know, an integer p greater than 1 whose only integer factors 
are +1 and +p is called a prime number, or just a prime. The first 
few primes are 


Panes el Peal ta Meera 
Notice that even though 2 is a prime, 
Ce) i=) = 1S = 


This shows that 1 +i and 1 —7 are both factors of 2, so if we allow 
not only integer factors but also factors that are complex numbers 
whose real and imaginary parts are integers, then 2 is no longer a 
prime! A complex number a + bi for which a and b are integers is 
known as a Gaussian integer, after the German mathematician 
Carl Friedrich Gauss (1777-1855), who first developed them. 


Just as any ordinary integer x can be factorised in two ‘trivial’ ways 
as © = 1 x x and x = (—1) x (—2), so any Gaussian integer z can be 
factorised in four trivial ways as 


2—=1x%2—=(—1) x (—2) S=7 =< C22) = (2) x ee). 
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If these are the only ways in which a Gaussian integer z can be 
factorised as a product of two Gaussian integers, then z is called a 
Gaussian prime. You’ve seen that 2 is not a Gaussian prime, and 
neither is 5, because 

(2+7%)(2—7%) =5. 
However, it can be shown that 3 is a Gaussian prime, as is 1 + i, and 
in fact there are infinitely many Gaussian primes. 


You can learn about the Gaussian integers in more advanced modules 
on number theory. 


Before you find out how to divide complex numbers, it’s useful for you to 
learn about another operation that you can perform on complex numbers, 
called complex conjugation. 


The complex conjugate of a + bi is a — bi. 
The complex conjugate of z is denoted by Z. 


The operation of transforming z to Z is called complex 
conjugation. 


For example, 
if z=-3+2i then Z=-—3-2i, and 
if z=5-34 then Z7=5+ 3%. 
Activity 8 Finding complex conjugates 


Find the complex conjugate of each of the following complex numbers. 


(a) 442i (b) 3-81 (c) 9% (d)5 


Notice that if z is a real number, then Z = z. So complex conjugation has 
no effect on real numbers. 


In the next activity you’re asked to show that to undo complex 
conjugation, you just apply complex conjugation again. 


Activity 9 Applying complex conjugation twice 


Let z = a + bi and w = Z. Show that W = z. 
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There are some more properties of complex conjugation in the box below. 
The last property involves division of complex numbers, which you'll meet 
shortly. 


These properties, and all the other complex number properties stated in 
this unit, hold for all complex numbers for which the expressions in them 
are defined. For example, the fourth property below holds for all complex 
numbers z and w with w 4 0. 


Some properties of complex conjugation 


You can prove these properties by writing z = a+ bi and w = c + di. This 
gives 


at+c)+(b+d)i 
ge E 


In the same way you can show that z — w = Z — W. 


To prove the third property, let’s work out the sides of the equation 
separately. We have 


ZW = (a — bi)(c — dt) 
= ac — adi — bei + bdi? 
= (ac — bd) — (ad + bc)i. 
Also, 
zw = (a+ bi)(c+ di) 
ac + adi + bci + bdi? 
= (ac — bd) + (ad + bc)i, 
which gives 


zw = (ac — bd) — (ad + be)i. 


So both Z W and ZW are equal to (ac — bd) — (ad + bc)i, which implies that 
ZW = ZW. 


The fourth property will be proved in the next subsection, once you’ve 
seen how to divide complex numbers. 


In the next activity you’re asked to prove two further properties of 
complex conjugation. 


1 Arithmetic with complex numbers 


Activity 10 Proving two identities involving complex numbers 


By writing z = a+ bi, prove the following identities. 


(a) z +Z =2Re(z) (b) z- Z = 2Im(z) 


Here’s the property of complex conjugation that’s useful when you want to 
divide complex numbers: whenever you multiply a complex number by its 
complex conjugate, you obtain a real number. To see why this is, recall the 
formula for the difference of two squares from Unit 1: 


(A+ B)(A— B) = A? — B?. 


Given any complex number z = a + bi, we can apply the difference of two 
squares formula with A = a and B = bi to give 


(a + bi) (a — bi) = a? — (bi)? = a? — bi? = a? + 0. 
n 


So we have the useful fact below. 


For any complex number z = a + bi, 


22 =a +b. 


For example, 


+21- 22) = 17 +2 =1+4=5. 


Activity 11 Multiplying complex numbers by their complex 
conjugates 


Multiply the following complex numbers by their complex conjugates. 
(a) 2+ 3i (b) —1— 2i (c) 5i (d) —2 


1.5 Dividing complex numbers 


In this subsection you’ll learn how to divide complex numbers, making use 
of the complex conjugation operation that you met in the previous 
subsection. 


Suppose, for example, that you want to divide 5 + 3i by 1 + 2i. You can 
write the result as 
5 + 31 


1+ 2i 
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To see that this really is a complex number, you can simplify it by 
multiplying the top and bottom of the fraction by the complex conjugate 
of the denominator. For our fraction, the denominator is 1 + 22, so its 
complex conjugate is 1 — 2i. Multiplying the top and bottom of the 
fraction by 1 — 2i gives 

5+3  (54+34)(1—2) 5-10i+3i-67? 11-7 

142i (14+ 2%)(1—2i) 1? + 2? © 5 
You can write this number as 


4(11—7i) or Bi. 


The second alternative has the form a + bi, which shows that it is indeed a 
complex number. However, it’s fine to leave the answer in any of the final 
three forms above. 
Simplifying fractions in this way may remind you of simplifying fractions 
involving surds, such as 

1+5V3 

V2+ V3" 


which you met in Section 4 of Unit 1. To do this, you used a conjugate of 
an expression involving surds, which performs a similar role to the complex 
conjugate. 


1 Arithmetic with complex numbers 


You can check that the answer that you obtain from a division of complex 
numbers is correct by using multiplication. In the example that you’ve just 
seen, for instance, you find that 


(1 + 8i) x (5-4) = (5 + 39i — 817) = 413 + 39i) = 1 + 3%, 
as expected. 


Activity 12 Dividing complex numbers 


Write each of the following fractions as a single complex number. 


1 2% 11 — 8 1 1 

OS tee iy a WS 
4+ 71 8+ 32 —2+ 52 
Oo “aaa “Sas 


It isn’t possible to divide a complex number by 0, just as it isn’t possible 
to divide a real number by 0. 


As for real numbers, if z is a non-zero complex number, then the number 
1/z is called the reciprocal of z. Also, as for real numbers, 


1 1 1 
— is denoted by 27t, =z is denoted by a, =z is denoted by e*, 
z Zz z 


and so on. 


As mentioned earlier, the index laws from Unit 1 hold when the base 
numbers are complex numbers and the powers are any integers, positive, 
negative or zero. For example, the following index laws hold for all 
complex numbers z and w and all integers m and n: 


oa, F7 z2™"” and (zw) = z"w". 
When you’re using complex numbers as base numbers, you should be 
careful to apply index laws only in cases in which the indices (such as m 
and n above) are integers. This is because the definitions of expressions 
such as z!/2, where z is a complex number, are beyond the scope of this 
module. As you'll see in Section 3, all complex numbers other than 0 have 
two square roots, so an expression such as z!/2 is potentially ambiguous 
since it could refer to either of the square roots. 


Finally, as promised, here’s the proof of the fourth property of complex 
conjugates stated in the box on page 186. The property is 


z/w =2/O. 

If w £0, then 
(z/w)xw=2z, so (z/w)xw=z. 

Hence, by the third property in the box on page 186, 
z/w X WZ. 


Dividing both sides by W gives the property stated above. 
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2 Geometry with complex numbers 


In this section you'll learn how complex numbers can be represented by 
points in a plane, called the complex plane, which gives a rich geometric 
way of interpreting the arithmetic operations described in the previous 
section. You'll see that you can add and subtract complex numbers 
represented by points in the complex plane in a similar way to how you 
add and subtract vectors. 


2.1 The complex plane 


You saw in Unit 1 that the real numbers can be represented as points on a 
line, called the real line or the number line, as shown in Figure 4. 


Figure 4 Part of the real line 


Since a complex number is made up of two real numbers, its real part and 
its imaginary part, it can be represented by a point in a plane. 


The complex plane 


The complex plane is a plane in which the complex number a + bi 
is represented by the point (a,b). 


The horizontal axis is called the real axis and the vertical axis is 
called the imaginary axis. 


For example, the complex number 3 + 4i is represented in the complex 
plane by the point (3,4). This complex number is shown, with others, in 
Figure 5. Notice that the origin represents the complex number 0. 


imaginary axis 


real axis 


© 
—l1—32 


Figure 5 Complex numbers in the complex plane 
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The grid in Figure 5 is a unit grid, which means that adjacent horizontal 
lines in the grid are separated by one unit, and adjacent vertical lines are 
also separated by one unit. All the grids that you’ll see in this unit are 
unit grids. 


Note that for simplicity we usually don’t distinguish between complex 
numbers and the points that represent them in the complex plane. For 
example, we say that the complex number i lies on the imaginary axis, 
rather than that the point representing the complex number 7 lies on the 
imaginary axis. 


Activity 13 Marking complex numbers in the complex plane 


Mark the following complex numbers on a diagram of the complex plane: 


2-4, —-34+2), 3, —i and 4 


The complex plane is also known as the Argand diagram, after a 
French mathematician with the surname Argand, who in 1806 wrote 
an essay on representing complex numbers geometrically in a plane. 


Recent research has shown that reliable biographical information 
about Argand is extremely limited; not even his first name is known! 
There is no evidence that he was a certain Swiss-born man called 
Jean-Robert Argand, as was previously believed. Information in one 
of Argand’s publications suggests that he was a scientifically oriented 
technician, based in the Parisian clock-making industry. 


The idea of introducing a complex plane had been proposed before, by 
the English mathematician John Wallis (1616-1703), and separately 
by the Norwegian—Danish mathematician Caspar Wessel (1745-1818), 
but these earlier proposals failed to gain popular acceptance. 


Let’s consider what happens in the complex plane when you add two 
complex numbers. By the usual rule for adding complex numbers, 


(a+ bi) + (c+ di) =(a+c)+(b+d)i. 


So if you add the complex numbers represented by the points (a,b) and 
(c,d), then you get the complex number represented by the point 
(a+c,b+d). That is, you just add the coordinates separately, in the same 
way that you add the components separately when you add vectors. 


Because of this, you can add two complex numbers in the complex plane 
by drawing (or imagining) a parallelogram, as illustrated in Figure 6. This 
is the parallelogram law for the addition of complex numbers, which is 
similar to the parallelogram law for vector addition. 
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Figure 6 The parallelogram law for the addition of complex numbers 


Next let’s consider what happens in the complex plane when you multiply 
a complex number by a real number. We have 


m(a + bi) = ma + mbi. 


So if you multiply the complex number represented by the point (a,b) by 
the real number m, then you get the complex number represented by the 
point (ma,mb). That is, you just multiply the coordinates separately by 
the real number, in the same way that you multiply the components 
separately when you multiply a vector by a scalar. 


The effect of this is that when you multiply a complex number z by a real 
number m, the resulting complex number mz remains on the line that 
passes through the origin and z, but is |m| times as far from the origin as z 
is. If m is positive, then mz lies on the same side of the origin as z, and if 
m is negative, then mz lies on the opposite side of the origin from z. These 
effects are illustrated in Figure 7. They’re similar to the effects of 
multiplying a vector by a scalar. 


(a) (b) 


Figure 7 (a) Multiplication by 2 (b) Multiplication by —1 
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Complex conjugation also has a simple geometric interpretation in the 
complex plane: the number Z = a — bi is the image of z = a + bi under 
reflection in the real axis, as shown in Figure 8. 


Nl 


Figure 8 Complex conjugation in the complex plane 


Activity 14 Adding complex numbers, multiplying complex numbers 
by real numbers, and complex conjugation in the complex plane 


On a copy (or several copies) of the diagram below, mark the complex 
numbers in parts (a)—(f). Do this by thinking geometrically, in the ways 
described above, rather than by first writing z and w in the form a + bi 
and using the algebraic methods from Section 1. 


(a) z+w (b) 2z (c) Z (d) z+z (e) —w (£) z- w 
(Remember that z — w = z + (—w).) 


You can check your answers to Activity 14 by writing z and w in the form 
a+ bi and using the methods from Section 1. You can see from the 
diagram that z = 2 + i and w = 1 + 3i. Therefore, for example, 

z+w = (2+i)+(1+3i)=(2+1)+ (1+3) =3+ 4i. 


If you plot this answer 3 + 4i in the complex plane, as shown in Figure 9, 
then you obtain the same point that you found in Activity 14(a) by using 
the parallelogram law. 


193 


Unit 12 Complex numbers 


194 


z+w=34+4i 
w = 1 +i3i 


Figure 9 Adding 2 + i and 1 + 3i in the complex plane 


2.2 Modulus of a complex number 


You saw in Section 2 of Unit 2 that the modulus of a real number x, which 
is denoted by |x| and is also called the magnitude or absolute value of x, is 
the distance of x from zero. For example, |3| = 3 and | — 3| = 3. 
Essentially, the modulus of a real number zx is the ‘size’ of x. 


In the same way, the modulus (or absolute value) of a complex 

number z, which is denoted by |z|, is its distance from the origin, and is a 
measure of its size. (We don’t use the word ‘magnitude’ for complex 
numbers.) Pythagoras’ theorem gives the formula below for the modulus of 
a complex number, which is illustrated in Figure 10. 


Modulus of a complex number 


Let z = a + bi. The modulus |z| of z is its distance from the origin, 
given by 


|z| = Va? + 0?. 


The plural of ‘modulus’ is moduli. 


z=a+bi 


|z| = Va? + b2 


Figure 10 The distance between the origin and z = a + bi 


is |z| = Va? + b?. 


2 Geometry with complex numbers 


Let’s calculate the moduli of some particular complex numbers. For 
example, 


|3 — 4il = \/32 + = V9 F 16 = V25 =5 
and 
|= 7| = V(-7)? = v49 =7. 


As illustrated by the second example, if z is a real number, then |z| is just 
the usual modulus of this real number. 


Activity 15 Finding the moduli of complex numbers 


Find the moduli of the following complex numbers. 
(a) 3+ 4i (b) —4+4 3i (c) 0 (d) —31 (e) 17i 
G7- eji Wie Ga () —3 + BA 


In Subsection 1.4 you saw that, for any complex number z = a + bi, 
2B =a +0. 

The expression a? + b? is equal to |z|?, so we have the useful identity 
2% = |z’. 

We can obtain another useful identity by replacing z by zw in the 

identity |z|? = zZ, to give 

zw|? = zwzw. 

You learned earlier that ZW = ZW, so 

zw|? = zw2w = zwzW = 2zww = |z|?|w)?. 


Hence |zw|? = |z|?|w|?, and taking square roots gives the useful identity 


zw| = |z||w|. 


From this identity you can deduce a third useful identity, as follows. If 
w #0, then 


zZ 


w 


_ kl 


w| 


a= lixu- lli © 


Properties of modulus 


z = lF 
|zw| = ie 
F |=; E = 
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Activity 16 Proving identities involving modulus 


By writing z = a+ bi, show that |—z| = |z| and |Z| = |z|. 


If you divide both sides of the identity zz = |z|? by Z, then you obtain 


_ lel? 

= 
Taking the reciprocal of both sides of this equation gives you a neat 
formula for finding 1/z, as follows. 


For example, the reciprocal of 1 + 3i is 


1 1+ 32 1— 32 1— 32 


143i [1+3 12432 10 


Activity 17 Working out the reciprocals of complex numbers 


Find the reciprocals of the following complex numbers. 
(a) 2+i (b) -1 — 3i (c) 2i 


2.3 Argument of a complex number 


You can completely specify a particular complex number by stating its 
modulus, that is, its distance from the origin in the complex plane, 
together with the direction in which it lies from the origin. This direction 
is usually specified by stating a particular angle associated with the 
complex number, which we call an argument of the complex number. 
You'll learn about the arguments of a complex number in this subsection. 


Together, the modulus and an argument of a complex number can be used 


to express the complex number in a helpful form, called polar form, which 
you'll meet in the next subsection. 


Arguments of a complex number 


An argument of a non-zero complex number z is an angle in radians 
measured anticlockwise from the positive real axis to the line between 
the origin and z. 


The number 0 doesn’t have an argument. 


2 Geometry with complex numbers 


For example, one argument of the complex number —4 + 4i is 37/4, as 
shown in Figure 11(a). You can see that this angle is 37/4 because it’s 
three-quarters of a half-turn, and a half-turn is 7 radians. Other 
arguments of the same complex number are —57/4 and 117/4, as shown in 
Figures 11(b) and (c). The angle —57/4 is negative because it’s measured 
clockwise, rather than anticlockwise. 


—A+ 4t— 4 44444 —-44+41-, 
= Ln 
4 
Be > > NN > 
ND SP 
TA 


(a) (b) (c) 


Figure 11 Three different arguments of —4 + 4i 


You’ve encountered this sort of situation before, in which the same 
direction is specified by different angles. For example, it occurred when 
you worked with the directions of vectors in Section 6 of Unit 5. The 
angles —57/4 and 117/4 each have the same rotational effect as 37/4, 
because they each differ from 37/4 by an integer multiple of 27: 

om ƏT lir 3a 


i PE and aC fro 


In fact, you can see that each non-zero complex number has infinitely 
many arguments, because you can add any integer multiple of 27 to an 
argument to obtain another argument. Exactly one of these arguments lies 
in the interval (—z, 7], and it’s often the simplest one to use. 


Principal argument of a complex number 


The principal argument of a non-zero complex number z is the 
argument of z that lies in the interval (—7,7]. This angle is denoted 
by Arg(z). 


For example, the principal argument of z = —4 + 4i is 37/4. That is, 


3 
Arg(—4 + 4i) = Z. 


In some other texts, the principal argument is called the principal value 
of the argument, and sometimes the interval [0, 27) is used rather 

than (—7, 7]. 

Let’s now consider how to find the principal argument of a complex 
number. If the complex number lies on one of the axes, then you can find 


the principal argument just from a sketch, as the next example 
demonstrates. 
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Activity 18 Finding the principal argument of a complex number 
that lies on one of the axes 


Find the principal argument of each of the following complex numbers. 
(a) Zi (b) —4i (à -3 (dd) 2 


The method for finding the principal argument of a complex number that 
doesn’t lie on one of the axes is similar to the method for finding the 
direction of a two-dimensional vector from its components, which you met 
in Unit 5. It involves finding values of inverse tangent, so you may find it 
helpful to refer to Table 1, which contains the tangents of special angles 
that you met in Unit 4. 


Table 1 Tangents of special angles 


0 


in radians vane 

T 1 

6 V3 
T 

— 1 

4 

T 

— 3 
V3 
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Example 5 Finding the principal argument of a complex number A 
that doesn’t lie on one of the axes — 


Find the principal argument of the complex number —1 + iV3. 
Solution 


@. Sketch —1 + i/3 in the complex plane. The important thing is to 
get it in the correct quadrant. Label the principal argument 0, and 
label by ¢@ the acute angle between the real axis and the line from the 
origin to z. & 


2=-1+iv3 


@. Draw a line from z to the real axis that is perpendicular to the 
real axis to form a right-angled triangle. Mark the lengths of the 
horizontal and vertical sides of the triangle. ® 


z=-1+iv3 
v3 6 


— 


@. Use the triangle to work out the acute angle ¢, and hence work 
out the principal argument 0. © 


From the diagram, 


tang = Y= V3 


Therefore ¢ = 7/3. So the principal argument is 


T 2m 
d= — = — — = —, 
T—Q=T 3 3 
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Activity 19 Finding the principal arguments of complex numbers 
that don't lie on one of the axes 


Find the principal argument of each of the following complex numbers. 


(a) 1+i (b) 1-#/3 (eo) -V3-i (d) 2V3 - 2 


What's arg (z)? 


You're both wrong! 
ts 110 


Possible origin of the term ‘argument’ 


2.4 Polar form 


All the complex numbers that you’ve met so far have been written in the 
form a + bi, for real numbers a and b. This form is known as the 
Cartesian form of a complex number. In this subsection you’ll learn 
about an alternative way to write complex numbers, using the modulus 
and argument, known as polar form. In fact, the idea is not entirely new to 
you, as you met a similar procedure in Unit 5 when you saw how to find 
the components of a vector from its magnitude and direction. 


To find the polar form of a non-zero complex number z, we begin with the 
Cartesian form z = a + bi, and express a and b in terms of the modulus 
and one of the arguments of z. Let’s write r for the modulus |z|, and 0 for 
one of the arguments, as shown in Figure 12. In this case 0 happens to be 
the principal argument of z, but the discussion that follows is valid no 
matter what the choice of argument. 
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Figure 12 A complex number z = a + bi with modulus r and argument 6 


To see how to write a and b in terms of r and 0, first consider the complex 
number w that has the same argument as z, namely 6, but whose modulus 
is 1, as shown in Figure 13(a). This complex number w lies on the unit 
circle, and hence it follows from what you saw in Unit 4, Subsection 2.2, 
that its coordinates are (cos 8, sin 0). This is true whatever the size of the 


argument 0. 


ow A. 
oe “es 


(a) (b) 


Figure 13 The complex numbers (a) w, with modulus 1 and argument 6 
(b) z, with modulus r and argument 0 


In other words, 
w = cos@ + isin ð. 

Since z = rw, as illustrated in Figure 13(b), it follows that 
z = r(cos0 + isin 0). 


This is the polar form of z, in which z is written in terms of its modulus 
and one of its arguments. 


Polar form of a complex number 
A non-zero complex number z is in polar form if it is expressed as 
z = r(cos0 + isin 0). 


Here r is the modulus of z, and 0 is an argument of z. 
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For example, the complex number z with modulus 2 and argument 37/4, 
which is shown in Figure 14, can be written in polar form as 


s-2(o=(%) +a (%)) 


Figure 14 The complex number z with modulus 2 and argument 37/4 


As you saw in the previous subsection, any angle that differs from 37/4 by 
an integer multiple of 27 is also an argument of the complex number z in 
Figure 14. For instance, —57/4 and 117/4 are also arguments of z, so you 
can alternatively write z in polar form as 


(e(a) 


or 


_9 lla pia lla 
z= COS Z sın Za š 


However, unless there’s a good reason not to do so, you should use the 
principal argument when writing a complex number in polar form, which 
in this case is our original argument 37/4. 


The number 0 doesn’t have an argument, so it doesn’t have a polar form 
either. If you want to write the number zero, then you should just use the 
symbol 0, whether you’re working with Cartesian or polar forms. 


Let’s now consider how to convert between the Cartesian and polar forms 
of a complex number. To convert a complex number from polar form to 
Cartesian form you need to work out values of sine and cosine. You can 
use your calculator to do this, but you’ll develop a better understanding of 
the procedure if instead you use techniques from Unit 4 for working out 
sines and cosines of simple fractions of 7, and refer to Table 2, which 
contains the sines and cosines of special angles from Section 1 of Unit 4. 
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Table 2 Sines and cosines of special angles 


; 4 i sin 0 cos 0 
in radians 
0 0 1 
z TE 
6 2 2 
T 1 1 
1 a | VB 
T v3 1 
3 2 2 
T 
— 1 
5 0 


1) 


Activity 20 Converting from polar form to Cartesian form 


Write the following complex numbers in Cartesian form. 
ae T car T 

(a) 3(cos0 + isin 0) (b) 7 (cos (5) +isin (Z) 
ou T an T 

(c) 6(cos a + isin 7) (d) cos (-3) +isin (-3) 


T 


(e) 5(cos(—m) + isin(—7)) (f) 4 (cos (=) Pusha (5)) 


(8) 2 (cos (-7) +isin (-7)) (h) V3 (cos E eer (=)) 


To convert a complex number from Cartesian form to polar form, you need 
to calculate its modulus and principal argument. 
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Cc) Example 7 Converting from Cartesian form to polar form 
Write the complex number —2 — 2v3 i in polar form. 
Solution 

@. First find the modulus. © 


The modulus is 
2 
r = \/(-2)?+ (-2v3) = VIF 2 = VIG =4. 


@. To find the principal argument, sketch z = —2 — 2\/3i in the 
complex plane. The important thing is to get it in the correct 
quadrant. Label the principal argument 0, and label by ¢ the acute 
angle between the real axis and the line from the origin to z. ® 


A 


ie 
0 
Z= SS Ga 


@. Draw a line from z to the real axis that is perpendicular to the 
real axis, to form a right-angled triangle. Mark the lengths of the 
horizontal and vertical sides of the triangle. © 


A 


LA 


24/3 
2S N DA: 
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When you convert a complex number from Cartesian form to polar form, 
you can readily check your answer by converting back. For example, for 
the complex number in Example 7, 


z=4 (cos (-7) +isin (-7)) = (-; -:£) = -2-2V/3i. 


You can use your solutions to Activities 18 and 19 to help you with the 
next activity. 


Activity 21 Converting from Cartesian form to polar form 


Write the following complex numbers in polar form. 
(a) fi (b) -4 (o -3 (@) 2) 1+i (f) 1-iv3 
(g) -v3-i (h) 2v3-2i 


2.5 Multiplication and division in polar form 


The Cartesian form of a complex number is convenient for adding and 
subtracting complex numbers, because to add or subtract complex 
numbers you just add or subtract their real and imaginary parts 
separately. In this subsection you’ll see that polar form is more suited for 
multiplication and division. 


Consider two complex numbers in polar form, z = r(cos 0 + isin 0) 
and w = s(cos¢+isin¢). Then 


zw =r(cos@+isin@) x s(cos ġ + isin ¢) 
= rs(cos @cos $ + i cos Osin ¢ + isin 6 cos ¢ + i? sin sin ¢) 


= rs((cos 9 cos ¢ — sin @ sin $) + i(cos sin ¢ + sin 6 cos ¢)). 
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Recall the angle sum identities for sine and cosine, from Section 4 of Unit 4: 


sin(0 + ¢) = sin@cos ¢ + cos 0 sin o 
cos(@ + ¢) = cos 0 cos ¢ — sin @sin ¢. 


Using these identities we obtain 
zw = rs(cos(0 + ¢) + isin(6 + ¢)). 


This shows that to multiply two complex numbers in polar form, you 
multiply their moduli and add their arguments. 


Product of complex numbers in polar form 
Let z = r(cos 0 + isin 8) and w = s(cos¢+ising). Then 
zw = rs(cos(0 + d) + isin(0 + ¢)). 


Example 8 Finding the product of complex numbers in polar form 


Let 
go = iO (cos (=) +7sin (=)) 
nd 
_5 Ar Wee An 
w= cos 5 tsin 5 


Find zw in polar form. 
Solution 
@., Multiply the moduli and add the arguments. ® 


— 50 a oe ee 3a, An 
Bw = co 5 5 isin 5 5 
y T 
= 0) (cos (=) + isin (=) 


@. The argument 77/5 doesn’t lie in the interval (—r, 7], so it isn’t 
the principal argument of zw. To find the principal argument, 
subtract an integer multiple of 27 from 77/5 to obtain an angle that 
lies in the interval (—7, z]. ® 


Since 
71 w Me 37 
= -— oT z= — — eo — — — 


5 5 5? 
it follows that 


3 3 
zw = 0) (cos (-=) + isin (-=)) : 
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In the next activity, remember to use the principal argument in the polar 
form in your answers. 


Activity 22 Finding the product of complex numbers in polar form 


Find zw in polar form in each of the following cases. 


(a) 2=12 (cos (=) +isin 6) w=4 (cos (5) +isin (5)) 
T 7 3 3 
(b) 2=8 (cos (=) + isin (=)). w= 5 (cos (=) + isin (=)) 
(c) 87 age 87 87 bia 8T 
z= — in | — = — in | — 
c cos | —> is g p w= cos | > is 9 
You’ve seen that to find the product of two complex numbers in polar form 
you multiply their moduli and add their arguments. You do the same to 


find the product of three or more complex numbers in polar form. For 
example, the product of 


3 Ar a An 
U= cos E +ısın 5 ; 
v= 2 (cos ( =) isin ( =)) and 
5 5 
T 


is 


4n d T dár Qn T 
= 2 ee a iat oe 
uvw = (3 x 2 x 7) (o (2 +2) +isin (3 7 +2)) 


-o(a (8) e E) 


Activity 23 Finding the product of several complex numbers in 
polar form 


Find the product of the following three complex numbers in polar form: 


aa (f2) tisa (2) 
v = cos (=) + isin (=) 
w= (cu (22) +1 (®)) 
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Let’s now look at how to divide two complex numbers in polar form. As 
before, let z = r(cos 0 + isin 0) and w = s(cos ọ + isin ġ). At the end of 
Subsection 2.2, you saw a formula for the reciprocal of a non-zero complex 
number, which gives 

ji wW 

w fw? 
Remember that |w| = s. Multiplying both sides of the equation above by z 
gives 


zZz č zW 
w |w]? 
r(cos0 + isin 0) x s(cos ¢ — isin ¢) 


s2 


= —( cos 8 cos ¢ — icos @sin ¢ + isin 0 cos ¢ — i” sin sin œ) 
= L ((cos ø cos g + sin ô sin ¢) + i(sin 8 cos @ — cos 8 sin ¢)). 


Recall the angle difference identities for sine and cosine, from Section 4 of 
Unit 4: 

sin(@ — ¢) = sin f cos ¢ — cos 0 sin ġ 

cos(@ — ¢) = cos 0 cos ¢@ + sin Ô sin ¢. 
Using these identities we obtain 


= L (cos(0 — ¢) + isin(f — ¢)). 


This shows that to divide two complex numbers in polar form, you divide 
their moduli and subtract their arguments. 


Quotient of complex numbers in polar form 


Let z = r(cos 8 + isin 0) and w = s(cos ọ + isin ¢). Then 


Z 


-= — (cos(0 — $) + isin(0 — ¢)). 


2 Geometry with complex numbers 


Solution 
@. Divide the moduli and subtract the arguments. © 


a(o eG) 
(e(a) 


@®. The argument —67/5 doesn’t lie in the interval (—7, 7], so it isn’t 
the principal argument of z/w. To find the principal argument, add 
an integer multiple of 27 to —67/5 to obtain an angle that lies in the 
interval (—7, 7]. ® 


Since 
5 5 5 D 
it follows that 


Paqe) a) 


Activity 24 Finding the quotient of complex numbers in polar form 


Find z/w in polar form in each of the following cases. 


(a) 2=12 (cos (=) +isin 6) w=4 (cos (=) +isin (=)) 
OT 


Using the polar form of complex numbers you can now visualise in the 
complex plane how to multiply one complex number by another, as the 
following example demonstrates. 


Example 10 Visualising multiplication by 21 geometrically 
Describe the geometric effect of multiplying a complex number by 2i. 
Solution 

@. Start by finding the polar form of 2i. ® 
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From the diagram, 


2i = 2 (cos (5) + isin (5)) | 


@. Consider a general complex number z in polar form, to be 
multiplied by 2i. ® 


Now let z = r(cos@ + isin 0). 
@. Find the polar form of z x 2i. @ 
Then 
; a T TE 
z xX 2i = r(cos0 + isin) x 2 (cos (5) + isin (5)) 


= Dip (cos (0+7) +isin (@+=)). 


@. If you find it helpful, sketch z and z x 27 in the complex plane to 
help you understand how z x 2i is obtained from z geometrically. ® 


BOM 


Therefore, multiplying z by 22 corresponds to an anticlockwise 
rotation of z through a quarter turn (7/2 radians) and a scaling of z 
by the factor 2. 


Activity 25 Multiplying a complex number by —i 


Describe the geometric effect of multiplying a complex number by —i. 
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2.6 De Moivre’s formula 


You’ve seen how to multiply two or more complex numbers in polar form, 
so it’s now possible to consider powers of complex numbers in polar form. 
There’s a helpful formula for working out powers, known as de Moivre’s 
formula, named after the French mathematician Abraham de Moivre. 

(‘De Moivre’ is pronounced as ‘de mwa-vr’, where the ‘e’ is spoken like the 
‘u’ in ‘number’.) To obtain this formula, we apply the formulas for 
products and quotients of complex numbers found in the previous 
subsection, as follows. 


You saw that to find the product of two or more complex numbers in polar 
form, you multiply their moduli (to give the modulus of the product) and 
add their arguments (to give an argument of the product). Applying this 
procedure to n copies of the complex number z = r(cos@ + isin @) gives 


z2=rxrx-::xr(cos(6+0+---+6)+isn(O+O0+---+8)). 
SS N ee” N i 
multiply moduli add arguments add arguments 


That is, 


z” =r"(cosné + isin n0). 


This is de Moivre’s formula, for positive integers. In fact the formula is 
valid for all integers n, as stated below. 


De Moivre’s formula 
Let z = r(cos 0 + isin 8). Then, for any integer n, 


z” = r” (cosnð + isin n0). 


So far you’ve only seen why de Moivre’s formula is true for positive 
integers. When n = 0 it’s certainly true, because then both sides of the 
formula are equal to 1. To see why the formula is true for negative 
integers, begin with any positive integer n, and write the numbers 

1 and z” in polar form: 


1=cos0+isinO and 2z” =r"(cosné+isinné). 
Now apply the formula for quotients of complex numbers in polar form to 


the complex numbers 1 and z”. Dividing the moduli and subtracting the 
arguments gives 


1 


er (cos(0 — n@) + isin(0 — n@)) = r~"(cos(—n8) + isin(—n6)). 


gh pn 
Since z7” is the same as 1/z”, we obtain 
z "=r "(cos(—né) + isin(—né)). 


Because —n can represent any negative integer, this confirms that 
de Moivre’s formula is also true for negative integers. 
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Abraham de Moivre 
(1667-1754) 
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Arguably Abraham de Moivre’s greatest contributions to mathematics 
were in the theory of probability. In 1733, in a paper published in 
Latin, he was the first person to discuss the normal distribution, an 
important mathematical tool for modelling probabilities. In 1738 he 
included an English translation of the paper in the second edition of 
his book The Doctrine of Chances: A method of calculating the 
probabilities of events in play. It is said that de Moivre correctly 
predicted the day of his own death after observing that he was 
sleeping 15 minutes longer each night. Assuming that this would 
continue, he predicted that he would die on the day that he slept for 
24 hours, and did indeed die on that day, 27 November 1754. 


De Moivre’s formula allows you to find powers of complex numbers in 
polar form quickly. To find powers of complex numbers in Cartesian form, 
you can convert to polar form, find the power, then convert back to 
Cartesian form, as the following example demonstrates. 


Example 11 Working out powers of complex numbers 


Find (-v3 + i)” in Cartesian form. 
Solution 
@. Write —\/3 + i in polar form. To do this, first find the modulus. @ 


The modulus of — v3 + i is 
2 
r=\/(-v3) +12=v341=vi=2 


@. Then find the principal argument, by first sketching — v3 + i in 
the complex plane. ©& 


= 


IS 


From the diagram, 


tan dg = 


L 
5 


2 Geometry with complex numbers 


Therefore ¢ = 77/6. So 
x _ Sn 


Cae 
-arg 


Hence 


-v3 +i =2 (cos (=) + isin =)) 


@. Apply de Moivre’s formula. ® 


Therefore 


(af (um) f) 
Sees), 


@. Convert to Cartesian form. To help do this by hand, first rewrite 
the polar form using the principal argument. © 
Since 
207 a 
6 a 6’ 
we obtain 


(y elo +150) 
= ($ 4 3) 


= 16V3 + 16i. 


Activity 20 Working out powers of complex numbers 
Work out the Cartesian forms of the following complex numbers. 
n3 1/3 T O D a EEE 
(a) (+i) (b) (2 (cos (7) + isin (7) ) ) (c) (-1+iv3) 
(d) (VB+i)° (e) (2+2) 


In the final activity of this section you can learn how to work with complex 
numbers on the computer. 


| | 
Activity 27 Manipulating complex numbers with the CAS iN] 


Work through Subsection 13.1 of the Computer algebra guide. 
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3 Polynomial equations 


A polynomial equation is an equation of the form 
‘polynomial expression = 0’, where the polynomial expression has degree 
at least 1. For example, 


5a®°+8=0 and x’ — 132° +32?-2=0 


are polynomial equations, and every linear or quadratic equation is a 
polynomial equation. 


At the start of this unit you were told the remarkable fact that by 
introducing a solution 7 of the quadratic equation 
z?+1=0 


to form the complex numbers, you make it possible to find at least one 
solution of any quadratic equation. In this subsection, you’ll learn how to 
find all the solutions of any quadratic equation, including any solutions 
that are not real numbers. You’ll also learn about the fundamental theorem 
of algebra, which shows that in fact every polynomial equation has at least 
one solution in the set of complex numbers. 


3.1 Quadratic equations 


As you know, a quadratic equation is an equation of the 
form az? + bz + c = 0, where a Æ 0. For example, 


227 —5z—-1=0 and 2?—624+25=0 


are quadratic equations. We use the variable z here (rather than the more 
familiar x, say) because we want to work with complex numbers, and it’s 
traditional to use the letter z for a variable that represents a complex 
number. For the moment, let’s restrict our attention to quadratic 
equations in which the coefficients a, b and c are real numbers, and leave 
the possibility that a, b and c might be complex numbers that aren’t 
necessarily real to the end of this subsection. 


You saw in Section 4 of Unit 2 that the number of real solutions 
(solutions that are real numbers) of the quadratic equation 

az? + bz +c = 0 depends on the value of the discriminant b? — 4ac. The 
equation has 


e two real solutions if b? — 4ac > 0 

e one real solution if b? — 4ac = 0 

e no real solutions if b? — 4ac < 0. 

For example, the quadratic equation 2z? — 5z — 1 = 0 has discriminant 
(5)? — 4 x 2 x (—1) = 25 + 8 = 33, 


so it has two real solutions. In contrast, the quadratic 
equation 2% — 6z + 25 = 0 has discriminant 


(—6)? — 4 x 1 x 25 = 36 — 100 = —64, 
so it has no real solutions. 
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Although some quadratic equations have no real solutions, every quadratic 
equation has either one or two solutions that are complex numbers, which 
we call complex solutions. Complex solutions can be real numbers 
(because a real number is a special type of complex number), but they can 
also be complex numbers that are not real, like 1 +i or —71. 


One way to find the complex solutions of a quadratic equation is to use the 
method of completing the square, in much the same way as in Unit 2. The 
only difference is that now you have to allow square roots of negative 
numbers. 


Example 12 Solving a quadratic equation that has no real solutions A 
by completing the square — 


Solve the quadratic equation z? — 6z + 25 = 0 by completing the 
square. 


Solution 

Completing the square gives 
(z= 3)? +16 =0; 

that is, 
(2—3)? ==16. 


@. Take the square root of both sides. Remember from 
Subsection 1.3 that if d is a positive real number, then the square 
roots of —d are +iV/d. ® 


Therefore 
z—-3=+4ivi6; 
that is, 
z — 3 = +4i, 
so 
2 = dise dy, 


Remember that you can check the solutions of a quadratic equation by 
substituting them back into the equation. For the quadratic equation in 
Example 12, we have 
(3 + 4i)? — 6(3 + 42) + 25 = (3 + 4i) (3 + 42) — 6(3 + 4i) + 25 
= 9 + 12i + 12i + 167” — 18 — 24i + 25 
= (9 — 16 — 18 + 25) + i(12 + 12 — 24) 
= 0. 


You can check the other solution, 3 — 47, in the same way. 
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Activity 28 Solving quadratic equations that have no real solutions 
by completing the square 


Solve the following quadratic equations by completing the square. 
(a) 227-2z7+2=0 (b) 27+4z+13=0 (c) 24+25=0 


Another way to find the complex solutions of a quadratic equation is to 
use the usual quadratic formula. 


The quadratic formula 


The solutions of the quadratic equation az? + bz + c = 0, where a, b 
and c are real numbers, are given by 


eee —b+ Vb — 4ac 
7 Le 


As you saw earlier, the quadratic formula gives two real solutions of the 
quadratic equation if b? — 4ac > 0 and one real solution if b? — 4ac = 0. 


When b? — 4ac < 0, there’s a problem with the quadratic formula, because 
the square root sign vV cannot be used with a negative number inside it. 
For convenience, we make an exception here, and when d is a positive real 
number we allow the notation +,/—d to mean the two square roots of the 
negative number —d, which you learned in Subsection 1.3 are +i/d. This 
practice doesn’t cause errors, because the + symbol means that you'll 
always consider both square roots of a negative number at once. 


For example, if b? — 4ac = —16 then the quadratic formula involves the 
term 


+/—I6, 


which means the two square roots of —16, namely +42. 
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Solution 
@. Use the quadratic formula. © 


—10+ V102-—4x 1x 34 


2x 


—10=+ yA — 136 


2 
=I y/o 
2 


@. The term +y—36 means the two square roots of —36, 
namely 6i. ® 


_ -10 + 6: 


2 
= = ae oY 


Activity 29 Solving quadratic equations using the quadratic formula 
Solve the following quadratic equations by using the quadratic formula, or 
a simpler method if possible. 

(a) 22+2z+2=0 (b) 274+6z2+9=0 (c) 327+5=0 
(d) 2 —4z+8=0 (e) 22+3z=0 (f) 227-32+5=0 


You can see from the quadratic formula that when a, b and c are real 
numbers such that b? — 4ac < 0, the two complex solutions of the 
quadratic equation az? + bz + c = 0 are complex conjugates of each other. 
For example, the two solutions —5 + 32 and —5 — 3i of the quadratic 
equation z? + 10z + 34 = 0 in Example 13 are complex conjugates of each 
other. Two complex numbers that are complex conjugates of each other 
are together called a complex conjugate pair. 


The next example shows how you can choose any complex conjugate pair 
and then find a quadratic equation whose solutions are that complex 
conjugate pair. 


Example 14 Finding a quadratic equation with a given pair of 
solutions 


Find, in its simplest form, a quadratic equation that has solutions 
2AE. 


3 Polynomial equations 


217 


Unit 12 Complex numbers 


218 


Solution 


@. For any numbers u and v, a quadratic equation with solutions 
u and v is (z—u)(z—v) = 0. & 
A quadratic equation with solutions 2 +7 is 

(z — (2+7))(z — (2—7)) =0. 
@. To simplify this equation, expand the brackets. Do this directly, 
or, to do it slightly more efficiently, first write the expression 
(z — (2 + i))(z — (2 — i)) as a difference of two squares, by rewriting 
z — (2 +i) as (z — 2) — i and z — (2 — i) as (z — 2) + i. @ 
Simplifying gives 

((z—2)—i)\((z—2)+i)=0 
@. Apply the difference of two squares formula 
(A— B)(A+ B) = A? — B? with A = z — 2 and B = i. @ 

=D =r =0 

a cie E 

z r = 0, 


So a quadratic equation with solutions 2 Æ i is z? — 4z + 5 = 0. 


The procedure in Example 14 always gives a quadratic equation in which 
the coefficient of z? is 1. However, you can multiply the equation through 
by any non-zero number to give a quadratic equation that has the same 
solutions but in which the coefficient of z? is not 1. 


Activity 30 Finding quadratic equations with given pairs of solutions 


Find, in their simplest forms, quadratic equations with the following pairs 
of solutions. 


(a) 142i (b) -34i (c) +7 (d) 14+ 


NI—= 
>. 


So far all the quadratic equations az? + bz + c = 0 that you’ve met have 
had real coefficients a, b and c, and this is the only type of quadratic 
equation that you'll solve in this module. However, the quadratic formula 
also gives the solutions of quadratic equations in which a, b and c are 
complex numbers that aren’t necessarily real. For such an equation, the 
expression +/b? — 4ac represents the two square roots of a complex 
number. You’ll learn how to find square roots, and other roots, of complex 
numbers in the next subsection. 
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3.2 Roots of complex numbers 


As you know, if a is any number, then any solution x of the equation 
rL =a 

is called a square root of a. For example, the equation 
r =4 


has two solutions; in other words, the number 4 has two square roots, 
namely 2 and —2. Similarly, you’ve seen that the equation 


r = —9 


has two solutions; in other words, the number —9 has two square roots, 
namely 32 and —3i. 


Notice that in both the cases a = 4 and a = —9 the pairs of square roots 
lie symmetrically on opposite sides of the origin, as shown in Figure 15. 


3i 


—3i 


(a) (b) 


Figure 15 The square roots of (a) 4 (b) —9 


You can investigate this symmetry property of square roots further in the 
next activity. 


Activity 31 = Investigating the symmetry of square roots H 


Use the Square roots of real numbers applet to investigate the solutions of 
the equation z? — a = 0, that is, the square roots of a, as a varies. 


In general, any solution z of the equation 

P 
where a is a complex number and n is a positive integer, is called an 
nth root of a. (We say ‘square root’ rather than ‘2nd root’, and ‘cube 
root’ rather than ‘3rd root’, as we do for real numbers.) In this subsection 
you'll learn how to find all the solutions of equations of this form. You’ll 
see that, if a is non-zero, then there are exactly n solutions, positioned 
symmetrically around the origin. If a is 0, then there’s only one solution, 
namely 0. 
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Let’s start by looking at equations of the form z” = 1, such as z° = 1 
and z+ = 1. Solutions of equations of this form are called roots of unity; 
the word ‘unity’ refers to the number 1. More specifically, 


e the solutions of the equation z? = 1 are the square roots of unity, 
e the solutions of the equation z? = 1 are the cube roots of unity, 
e the solutions of the equation z+ = 1 are the fourth roots of unity, 


and so on. 


Let’s consider the cube roots of unity. You know that 1 is a cube root of 
unity, because 1° = 1. There are two other cube roots though, namely 


i (-1 +iv3) and i (-1 = iv3) f 


Yov’ll see how to find such roots shortly, but for now let’s check that these 
numbers really are cube roots of unity. To check the first one, let’s first 
find the square and then the cube of the expression inside the brackets: 


(-1+av3)" =1-2iv3 +? (v3) 


= 1— 2ivV3 -3 
= —2 — 2i V3, 


(-1+iv3) = (-1+iv3) (-1+1v3) 
= (-2-2iv3) (-1+ iv3) 
= 2~ 2iv3 + 2V3- 2 (v3) 
=246 
=8, 

Therefore 
< Anan 
2 O Boo go o 


So $(-1 + iv3) is indeed a cube root of unity. 


Activity 32 Checking a cube root of unity 


Check that 3(-1 — iv3) is a cube root of unity. 


The three cube roots of unity all have modulus 1, as you can check directly 
if you wish. To see why, suppose that z is a cube root of unity, so z? = 1. 
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Taking the modulus of both sides of this equation gives 
Z| =1. 

Now 
|= |z x zx z| = |z| x |z| x |z| = zl, 
so it follows that 


z| = 1. 


Therefore |z| = 1; that is, z has modulus 1. 


You can see that a similar argument will hold for any nth root of unity, for 
any positive integer n. So, for any positive integer n, the nth roots of unity 
all have modulus 1. 


The three cube roots of unity are shown in the complex plane in Figure 16. 
Because they all have modulus 1, they all lie on the unit circle, which is 
the circle of radius 1 that is centred on the origin. 


unit circle 


} (-1+ iv3) 


TEET) 


Figure 16 The cube roots of unity 


It appears from Figure 16 that the cube roots of unity are equally spaced 
around the unit circle, and you’ll see shortly that this is indeed so. More 
generally, you'll see that for any positive integer n, there are n solutions of 
the equation z” = 1 and they’re equally spaced around the unit circle. One 
of these solutions is the number 1, of course, which lies on the positive part 
of the real axis. 


The next example illustrates a method for finding the nth roots of unity, 
for any positive integer n, using the case n = 5 as an example. You'll see 
later in the subsection that you can use essentially the same method to 
find the nth roots of any complex number. 


You might find it particularly helpful to watch the tutorial clip for the 
following example. 


3 Polynomial equations 
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Example 15 Finding the fifth roots of unity 
Solve the equation z° = 1. Sketch the solutions in the complex plane. 
Solution 


@. Write the unknown z in polar form, in terms of an unknown 
modulus r and an unknown argument 9. Also write the number on 
the right-hand side of the equation in polar form. © 


Let z = r(cos 0 + isin 8). Also, 1 = cos 0 +isin0. So the equation is 
(r(cos@ + isin 0))* = cos0 + isin 0. 


@. Use de Moivre’s formula to find the polar form of the left-hand 
side. @ 


De Moivre’s formula gives 
r°(cos 56 + isin 50) = cos0 + isin 0. 


@. Find r by comparing the moduli of the two sides of this equation. 
The left-hand side has modulus r°, and the right-hand side has 
modulus 1. © 


Comparing moduli gives 
rř=1, so r=1. 


@. Now find 6 by comparing the arguments of the two sides of the 
equation. The left-hand side has argument 50, and the right-hand 
side has argument 0. However, it doesn’t follow that 50 = 0. Instead 
it follows that 50 = 0 + 2mz, for some integer m. So there are 
infinitely many possible values of 50, and hence infinitely many 
possible values of 0. ® 


Comparing arguments gives 
50 = 0 + 2mm = 2m7, where m is an integer. 


Hence 


2MT : : 
Q= Ta where m is an integer. 


@. Find the five values of 0 given by m = 0, 1, 2, 3, 4. (You’ll see 
after the example that other integers m don’t give further solutions. 
In general, if the exponent in the original equation is n, then you 
should find the values of 0 given by n consecutive integer values of m, 
starting with m = 0.) ® 


Taking m = 0, 1, 2, 3, 4 gives the following values of 0: 


2x 4n O e 


NUS a a tae 


@. Write out the solutions. It’s convenient to label them as zo, 21, 22, 
23, Z4. & 


The solutions are 


zo = cos0+2zsin0= 1 


(=) ee 
2 = GOs 5 +27s1n 


(=) i? 
zy = Cos 5 +2s1n 


BSS. FO. FP 
= Ss e|5 S15 
Ne ee Nee 


@. Sketch the solutions in the complex plane. Each solution has 
modulus 1, so they all lie on the unit circle. Plot the solution zp = 1, 
and then sketch 21, 22, z3 and z4, in that order, by marking regularly 
spaced points separated by the angle 27/5 anticlockwise around the 
unit circle. Just estimate each angle 27/5; there’s no need for a 
precise drawing. ® 


In the solution to Example 15 the fifth roots of unity were given in polar 
form, with arguments in the interval [0, 27r) rather than principal 
arguments (remember that principal arguments lie in the interval (—7, z]). 
In general when you’re finding roots of complex numbers, it’s usually more 
convenient to use arguments in the interval [0, 27) rather than principal 
arguments. 
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To help you see why values of m other than 0, 1, 2, 3 and 4 don’t give 
further solutions of the equation z5 = 1 in Example 15, let’s try some of 
these other values of m, to see what happens. 


For example, if you take m = 5, then you obtain the solution 


107 ues 107 
= — isin | — |Ì. 
z = CoS 5 5 


Since 107/5 = 2r = 27 + 0, this solution is the same as the solution zo 
found in the example. Similarly, if you take m = 6, then you obtain the 
solution 


127 aa 127 
= cos | — isin | — |]. 
Z= 5 5 


Since 1277/5 = 27 + 27/5, this solution is the same as the solution z1 found 
in the example. Once you’ve taken m to be 0, 1, 2, 3 and 4, the solutions 
that you obtain just start to repeat. The same thing happens if you take 
negative values of m. 


Here’s an algebraic explanation of this repeating behaviour. When you use 
the method in Example 15 to find the nth roots of unity for some value 
of n, each possible value of @ is of the form 


2MT . . 
0 = ——, where m is an integer. 
n 


Consider any integer m. It can be written in the form 
m= qn +r, 


where q is an integer and r is the remainder after m is divided by n. (For 

example, if n = 5, then the integer m = 14, for instance, can be written as 

14 = 2 x 5 + 4.) So the value of 6 corresponding to m can be written as 

2(qn +r)r 
n 


2rT 

g= =q X 2r + —, 
n 

where q is an integer, and r is one of the integers 0,1,...,n — 1. That is, 
the value of 0 corresponding to m differs from the value of 0 corresponding 
to one of the integers 0,1,...,n — 1 by an integer multiple of 27. Hence 
the solution z arising from the integer m is the same as the solution arising 
from one of the integers 0, 1,...,n — 1. 


The following box summarises the main steps of the method used in 
Example 15. The method is stated for a general equation z” = a, where a 
is any non-zero complex number, rather than just for equations of the form 
z” = 1, because it applies in this more general situation, as you’ll see later 
in this subsection. 


Strategy: 
To find the complex solutions of the equation z” = a, where 


(ge =10) 
1. Write the unknown z in polar form, in terms of an unknown 


modulus r and an unknown argument 0, and write the number a 
in polar form. 


2. Substitute the polar forms of z and a into the equation, and apply 
de Moivre’s formula to find the polar form of the left-hand side. 


3. Compare moduli to find the value of r. 
4. Compare arguments to find n successive possible values of 6. 
5. Hence write down the n possible values of z. 


It is usually convenient to use arguments in the interval [0, 27). 


The solutions found in Example 15 were left in polar form because in that 
form they’re exact, and it’s difficult to find the exact Cartesian form of all 
of them. Also, the polar form helps you to see that the solutions give five 
equally spaced points around the unit circle. The solution zọ was given in 
both polar form and Cartesian form. This is because its Cartesian form is 
so simple (zo = 1) that it would be strange not to mention this. 


In the next activity you’re asked to find roots of unity in both polar form 
and Cartesian form, because it’s fairly straightforward to give exact 
solutions in both forms. You saw the solutions of the equation in part (a) 
of this activity earlier in the subsection, but you should obtain them again, 
using the strategy. 


Activity 33 Finding roots of unity 


Solve the following equations. Give your answers in both polar form and 
Cartesian form. Sketch the solutions in the complex plane. 


(a) 2 =1 (b) 2=1 (c) 22 =1 


Now let’s look at equations of the form 
"=a; 


where a is any non-zero complex number. Before we use the strategy above 
to solve an equation like this, let’s look at a particular example of such an 
equation. 


Consider the equation 
2 = -8. 


It has one real solution, namely —2, because (—2)? = —8. It also has two 
other complex solutions, namely 1 + iV3 and 1 — iv3, which you can check 
yourself. All three solutions are shown in the complex plane in Figure 17. 


3 Polynomial equations 


225 


Unit 12 Complex numbers 


Figure 17 The cube roots of —8 


The three solutions don’t lie on the unit circle; instead they lie on the 
circle of radius 2 centred at the origin. To see why, suppose that z is any 
solution of the equation, so z? = —8. Taking the modulus of both sides of 
this equation gives 


jz7|=8, so |zP=8. 
Therefore 
|z| = 81/3 = 2. 


That is, z has modulus 2, and therefore lies on the circle of radius 2 
centred on the origin. 


It appears from Figure 17 that the three solutions are equally spaced 
around this circle, and indeed they are. In general, the following fact holds. 


For each non-zero number a, the equation z” = a has n complex 
solutions, and these are equally spaced around a circle centred on the 
origin. 


The next example illustrates how to apply the strategy that you’ve seen in 
this subsection to find all the complex solutions of an equation of the form 
z” =a, where a is non-zero. Again, you might find it helpful to watch the 
tutorial clip. 
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Let z = r(cos 0 + isin 0). The complex number 4i has modulus 4 and 
principal argument 7/2, so its polar form is 


4i = 4 (cos > + isin). 


So the equation is 


Seay ae eee 
(r(cos @ + isin 6)) =4 (cos = + isin Z). 


@. Use de Moivre’s formula to find the polar form of the left-hand 
side. @ 


De Moivre’s formula gives 
r'(cos 40 + isin 40) = 4 (cos Z +isin Z) : 


@. Find r by comparing the moduli of each side of the equation. ® 
Comparing moduli gives 
r+ =4, so r=4/4 = V3. 


@. Find 6 by comparing the arguments of the two sides of the 
equation. & 


Comparing arguments gives 


46 = - + 2m, where m is an integer. 


Therefore 
T mq : : 
= 3 + F where m is an integer. 


@. Find the values of 0 given by m = 0, 1, 2, 3. ® 


The values of 0 for m = 0, 1, 2, 3 are 

mo om Dar 13a 

aan ae Sere 
@. Write out the solutions. It’s convenient to label them as zo, 21, 22 
and 23. & 


The solutions are 


æ= V2 (cos (Z) + isin (Z)) 
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@. Sketch the solutions in the complex plane. Each solution has 
modulus vo. so they all lie on the circle of radius /2 centred at the 
origin. Sketch the solution zp, and then sketch z1, z2 and zg, in that 
order, by marking regularly spaced points separated by angles 7/2 
anticlockwise around the circle. ® 


As expected, the solutions of the equation z* = 4i in Example 16 are 
equally spaced round a circle centred on the origin. Note that the infinitely 
many possible values of the argument, 0 = 7/8 + m7/2, where m is an 
integer, give repeating solutions of the equation z4 = 4i in a similar way to 
the solutions of z” = 1 described earlier. 


Activity 34 Finding roots of complex numbers 


Solve the following equations. Give the solutions to parts (a) and (b) in 
both polar form and Cartesian form, and give the solutions to part (c) in 
polar form only. Sketch the solutions in the complex plane. 


(a) 2ê = 64 (b) 22 = —8 (c) 2 =-1-i 


3.3 The fundamental theorem of algebra 


As you learned earlier, a polynomial equation is an equation of the form 
‘polynomial expression = 0’, where the polynomial expression has degree 
at least 1. The coefficients in the polynomial expression can be complex 
numbers. For example, the following equations are all polynomial 
equations: 


2+1+i=0, 322+ 42-18=0 and 2 -(174+3i)z+2=0. 


The first of these equations can be rearranged as z% = —1 — i, which is an 
equation of the type that you learned how to solve in the previous 
subsection. The second equation is a quadratic equation. Cubic 
equations (such as the third equation above) and quartic equations are 
polynomial equations in which the polynomial expressions have degrees 3 
and 4, respectively. This subsection is about the fundamental theorem of 
algebra, which shows that every polynomial equation has at least one 
solution. 


Before you meet the fundamental theorem of algebra, let’s first consider an 
example that will help you to understand it. In Example 12 on page 215 
you saw that the solutions of the quadratic equation z? — 6z + 25 = 0 are 
3 + 4i and 3 — 4i. It follows that you can factorise the quadratic 

z? — 6z + 25 as 


2? — 6z + 25 = (z — (3 + 4i)) (z — (3 — 4i)). 


More generally, if z1 and z2 are the solutions of the quadratic 
equation az? + bz + c = 0 (there may be only one solution, in which 
case z1 = 22), then you can factorise the quadratic az? + bz + c as 


az? +bz +c = a(z — z1)(z — 22). 


The fundamental theorem of algebra says that it’s not only quadratics that 
can be factorised like this, but in fact any polynomial expression can be 
factorised in a similar way. 


The fundamental theorem of algebra 
Every polynomial 

On” A Opa A ooo E a E 
of degree n > 1 has a factorisation 

an(z — z1)(z — 22)--- (Z — 2n), 


where 21, 22, ..., Zn are complex numbers, some of which may be 
equal to others. 


The fundamental theorem of algebra was first proved by 

Carl Friedrich Gauss, and a proof was also given by Argand, who was 
mentioned in Subsection 2.1. The proof requires techniques more advanced 
than those in this module and is given in higher-level modules. 


The fundamental theorem of algebra tells you that any polynomial 
equation, say 


anz? dae” +--+ aiz + ag = 0, 
can be written in the form 


an(z — z1)(z — z2): +- (z — zn) = 0. 


3 Polynomial equations 
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It follows that the polynomial equation has solutions 21, z2,..., Zn, some of 
which may be equal to others. In particular, every polynomial equation has 
at least one complex solution. 


Although the fundamental theorem of algebra tells you that every 
polynomial equation has at least one complex solution, it doesn’t tell you 
how to find any solutions. Often finding solutions is difficult, and you have 
to use a computer. You can learn how to do that in the next activity. 


Activity 35 Solving polynomial equations with the CAS 


Work through Subsection 13.2 of the Computer algebra guide. 


Insolvability of the quintic 


Not only is there a formula for the solutions of a quadratic equation in 
terms of its coefficients, but there are also formulas for the solutions 
of cubic equations and quartic equations in terms of their coefficients. 
These formulas are long, and difficult to evaluate without a computer. 
For polynomial equations of higher degree, however, it’s impossible to 
find formulas involving the usual arithmetic operations that give all 
the solutions of all the equations. For example, the quintic equation 


2 +32+6=0 


is said to be insolvable because, although it has five solutions, it’s 
impossible to obtain these solutions from the coefficients 1, 3 and 6 by 
using only the operations +, —, x, + and vy. 


One of the first mathematicians to develop the theory of insolvable 
polynomial equations was the French Republican Evariste Galois 
(‘Galois’ is pronounced ‘Gal-wah’). Galois’ discoveries led to the 
development of the subject now known as Galois theory, in which the 
solutions of polynomial equations are studied systematically. Sadly, 
Galois didn’t receive full recognition for his work in his lifetime, 
because he died aged only twenty, as a result of a duel. 


Evariste Galois (1811-1832) 


4 Exponential form 


In this section you'll learn about a more concise version of polar form, 
called exponential form. You'll see that when you write complex numbers 
in exponential form, some of their properties, such as the formula for 
multiplication and de Moivre’s formula, become more intuitive. 
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4.1 Euler’s formula 


To find the exponential form of a complex number from its polar form, you 
use a formula called Euler’s formula. This formula gives a meaning for the 
expression e’”, where @ is a real number. So far you’ve learned what the 
expression e” means only when z is a real number, but you can get some 
idea of what the natural meaning of e? might be by considering Taylor 
series. Recall from Unit 11 that the Taylor series about 0 for e” is 
oo go mt o gð gê m 

=le tatata tatt 
Let’s try substituting i0 for x in this series. Since you’ve learned about 
Taylor series only for real numbers, you haven’t been shown that you’re 
allowed to do this (in fact you are: it’s justified in more advanced 
modules). Nonetheless, you’ll see that the substitution can help you 
understand why Euler’s formula makes sense. We obtain 


i ca (0P G0 GD”. OB). GO). Cay" 
eee ee, 3! 4! 5! 6! 7! 
' pa {202 n 7393 i464 i505 {668 i760" 
=1+i 


a a a” a a 
To simplify this expression, notice the following pattern: 
P=1, isi, ®=-1, P=-i, *#=1, P =i,.. 
This pattern 1,i,-1,-i repeats. 


(You may remember this pattern from Activity 6.) It follows that 
2 3 gt 05 6 g7 


7 82 6 a A P 6 A 
=e or a a Pe a ey 


You also saw in Unit 11 that the Taylor series about 0 for cos @ and sin @ are 


ae gg É E Ë g 
Sl a va a and sin? =t- ataa S 
Therefore 
e? = cos + isin#. 


This equation is Euler’s formula. It’s named after its discoverer, the 
Swiss mathematician Leonard Euler (1707-1783), who was mentioned in 
Unit 3. Euler obtained the formula in much the way that we have. 


The manipulation above shows that if the Taylor series about 0 for e” is 
valid with ¿0 instead of x, then Euler’s formula must hold. So Euler’s 
formula seems to be the natural definition of e’”, and hence we use it to 
define e. 


4 Exponential form 
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Euler’s formula 


Let 6 be a real number. Then e” is defined by Euler’s formula 


e”? = cos6 + isin 0. 


When 0 = 7, Euler’s formula says that eT = cost + isin 7; that is, 
e7 = —1. This equation can be rearranged to give the equation below, 


which is known as Euler’s equation. 


Euler’s equation 
e™+1=0 


This equation, which is also sometimes called Euler’s identity, is one of the 
most famous equations in mathematics, because it relates the five 
fundamental numbers 0, 1, 7, e and 7. 


Euler’s formula e’? = cos@ + isin @ tells us that a complex number z in 
polar form 


z = r(cos0 + isin 0) 


can be written in a more concise way as 
z= re. 

This is the exponential form of a complex number, promised at the start of 

this subsection. 


Exponential form of a complex number 


A non-zero complex number z is in exponential form if it is 
expressed as 
z=r, 


where r is the modulus of z and @ is an argument of z. 


For example, because the complex number 3i has modulus 3 and principal 
argument 7/2, it has exponential form 3e'*/2. It also has other exponential 
forms, corresponding to different choices of argument of 3i, such as 3e°%/2 
and 3e—87'/2. However, you should use the principal argument when 
writing a complex number in exponential form, unless there’s a good 
reason not to do so. 


The number 0 doesn’t have a polar form, as you’ve seen, and it doesn’t 
have an exponential form either. When working with complex numbers in 
exponential form, you should write the number zero using the usual 
symbol 0. 


Because the exponential form of a complex number is just a concise way to 
write its polar form, the process of converting between the Cartesian and 
exponential forms of a complex number is much the same as the process of 
converting between its Cartesian and polar forms, which you learned in 
Section 2. 


Example 17 Converting from exponential form to Cartesian form 


Write the complex number 4e~°”/® in Cartesian form. 


Solution 


@. Write the number in polar form, evaluate the cosine and sine, and 
multiply out the brackets. ® 


Ae) Oa (cos (-=) +isin (-=)) 
TER 


You can use your solutions to Activity 20 on page 203 to help you with the 
next activity. 


Activity 30 Converting from exponential form to Cartesian form 
Write the following complex numbers in Cartesian form. 
(a) 3e? (b) rer (c) 6e!™ (d) eur (e) 5e 

(£) 4eit/3 (g) Je7in/4 (h) 3027/6 


Example 18 Converting from Cartesian form to exponential form 
Write the complex number 1 — 7 in exponential form. 

Solution 

@. First find the modulus. @ 


The modulus is 


r= /@4 ip = viet = vi. 


®© 


4 Exponential form 


233 


Unit 12 Complex numbers 


You can use your solutions to Activity 21 on page 205 to help you with the 
next activity. 


Activity 37 Converting from Cartesian form to exponential form 


Write the following complex numbers in exponential form. 
(a) 75/2 (b) —4i (© -3 (d)2 (e) 1+i (£f) 1-iv3 
(g) -v3-i (b) 2V3 — 2% 


It’s often better to use the exponential form re’? of a complex number, 
rather than its polar form r(cos 0 + isin @), because it’s shorter. Another 
reason for preferring the exponential form is that some of the formulas 
involving polar form that you met earlier become more intuitive when 
complex numbers are expressed in exponential form. 


Consider, for example, the formula for the product of two complex 
numbers in polar form: 


r(cos@ + isin 0) x s(cos¢+isin¢) =rs(cos(@ + ¢) + isin(0 + ¢)). 
Ne a Ne — 
rer) sei? rsei(O+e) 


If you write the complex numbers in exponential form, then this formula 
becomes 


re” x se’? = rset +o), 


This version of the formula is more intuitive, and hence easier to remember 
and use, because it agrees with the usual index laws for multiplying powers 
of a real number. 
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In particular, when r = s = 1 you obtain the following equation, which 
looks like it follows from a familiar index law, but involves complex 
numbers. 


coi’ — pi(6+d) 


Similarly, de Moivre’s formula, 
(r(cos0 + isin 0))” = r"(cosn6 + isin nd), 

is simpler and more intuitive when written using exponential form: 
(r) = rre, 


Choosing r = 1 gives the rule below, which again looks like it follows from 
a familiar index law, but involves complex numbers. 


Activity 38 Writing a formula for the quotient of two complex 
numbers in exponential form 


The formula for the quotient of two complex numbers in polar form is 
r(cos @ + isin 0) 
s(cos ¢ + isin d) 


Write this formula using exponential form. 


= ~(cos(6 — ¢) +isin(@— @)). 


Activity 39 Working with complex numbers in exponential form 


Express each of the following products, quotients and powers of complex 
numbers as a single complex number in exponential form. 
e37i/8 et /8 


in mi iT 4 
(a) € /8 x e /8 (b) (e /8) (c) PZA d eani/8 


You can use Euler’s formula to obtain further useful formulas, by 
combining it with properties of sine and cosine. For example, if you replace 
0 by —@ in Euler’s formula then you obtain 


e™™® = cos(—6) + isin(—6). 
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You saw in Unit 4 that 


cos(—0) =cos@ and sin(—6@) = —sin@, 
which give the following formula for e~”’. 


e~” = cos — isin 0 


Since cos 0 — isin @ is the complex conjugate of cos 0 + isin 6, we can make 
the following useful observation. 


The complex conjugate of e” is e~”’. 


ið — cos — isin@ is used in the next subsection to obtain 
id 


The equation e7 i 
formulas for sin 6 and cos @ in terms of et? and e 


Activity 40 Proving an identity by using Euler's formula and 
properties of sine and cosine 


Using Euler’s formula, prove the identity 
eil0+2m) — oih, 


Complex impedance 


In an electrical circuit powered by a direct current (from a battery, 
for example), the current J, measured in amperes, and the voltage V, 
measured in volts, are related by Ohm’s law, which says that 


Vea 
where R is the resistance of the circuit, measured in ohms. 


The electric power in our homes is not supplied by a direct current. 
Instead it’s supplied by an alternating current, generated by an 
alternating voltage. The intensities of the alternating current J and 
voltage V oscillate with time, and are typically given by equations 
such as 


J=Josinwt and V =Vosin(ut + ¢), 


Most large wind turbines where Jo and Vo are the maximum values of the current 
generate an alternating and voltage, respectively, w determines the rate of oscillation 
current of both the current and the voltage, and ¢ measures how 


far the current and voltage are from being synchronised. 
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The graphs of these equations are illustrated in Figure 18. 
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(a) (b) 


Figure 18 The graphs of (a) J = Jo sin wt (b) V = Vosin(wt + ¢) 


Electrical engineers use complex numbers in exponential form to 
manipulate equations arising from alternating currents. They define 
the complex current J to equal Joe“ and the complex voltage V to 
equal Voe(t+%). It follows that 


Im(J) = Im(Joe’*) = Jo sin wt 
and 

Im(V) = Im(Voe"“"t®)) = Vo sin(wt + ¢). 
So the current J is the imaginary part of the complex current J and 
the voltage V is the imaginary part of the complex voltage V. The 
complex impedance Z is defined to be the ratio V/J. It can be shown 
that Z measures the various ways in which an electrical circuit resists 


the flow of an alternating current. By rearranging the equation 
Z = V/J you obtain a version of Ohm’s law for alternating currents: 


W = id). 


The actual alternating voltage can be found by comparing the 
imaginary parts of each side of this equation. 


In Unit 4 you saw some trigonometric identities, such as 
sin 28 = 2sinĝcosð and cos20 = 2cos? 8 — 1. 


In this final subsection yov’ll learn how you can use de Moivre’s formula 
and Euler’s formula, together with the binomial theorem from Unit 10, to 
obtain new trigonometric identities. This illustrates the fact that 
techniques involving complex numbers can often be used to deduce results 
about real functions. 


4 Exponential form 
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One way to find new trigonometric identities is to use the special case of 
de Moivre’s formula given in the box below. It’s obtained by taking the 

modulus r to be 1 in the general formula on page 211 and swapping the 

sides. 


Special case of de Moivre’s formula 


cos né + isin nð = (cos@ + isin 0)” 


Example 19 Using de Moivre's formula to find trigonometric 
identities 
Use de Moivre’s formula to obtain the following trigonometric 
identities: 
cos 30 = 4 cos? 6 — 3 cos 0 
sin 30 = 3sin 0 — 4sin® 0. 
Solution 
@. Use the special case of de Moivre’s formula. Because the required 
identities involve 30, take n = 3. ® 
By de Moivre’s formula, 
cos 30 + isin 30 = (cos@ + isin 0)’ 
@. Apply the binomial theorem. © 
= (cos 0)? + 3(cos 8)? (i sin 0) + 3(cos 0) (i sin 6)? 
+ (isin 6)? 
= cos? 6 + 3i cos? Ø sin 8 — 3cos@sin? 6 — i sin? 0 
= (cos? 6 — 3 cos ô sin? 0) + i(3 cos? ô sin 0 — sin? 6). 
@. To obtain the first identity, use the fact that the real part of the 


left-hand side is equal to the real part of the right-hand side. To 
express sin? 6 in terms of cos 0, use the identity sin? 0 + cos? 0 = 1. ® 


Therefore 
cos 30 = cos? 0 — 3 cos 0 sin? 0 
= cos? 8 — 3 cos 6(1 — cos? 0) 
= 4 cos? 0 — 3 cos 0 


®. To obtain the second identity, use the fact that the imaginary part 
of the left-hand side is equal to the imaginary part of the right-hand 
side. @& 


When you’ve obtained a new trigonometric identity, it’s worth trying it out 
with some specific values of the variable 0, to check that you haven’t made 
an error. Consider, for example, the identity obtained in Example 19: 


cos 30 = 4 cos? 0 — 3 cos 0. 


When 0 = 0, 
LHS =1 and RHS=4-3=1. 
When 6 = 7/3, 


LHS =-—1 and RHS = 4x ($)? -3x $= -1. 
When 6 = 7/2, both LHS and RHS are 0. 


So the identity certainly holds for these particular values of 0, as expected. 


Activity 41 Using de Moivre's formula to find trigonometric 
identities 


Use de Moivre’s formula to obtain the trigonometric identities 
cos 40 = 8 cos’ 0 — 8 cos? 0 + 1 
sin 40 = 4sin 8 cos 0(cos” 0 — sin? 0). 


To keep your working short, you may find it helpful to write c = cos 0 
and s = sin 0. 


Another way to obtain new trigonometric identities is to start by finding 
formulas for sin 6 and cos@ in terms of et? and e~”’, where @ is a real 
number. To do this, recall Euler’s formula and the complex conjugate form 
of Euler’s formula, obtained in the previous subsection: 

ec? 


e- = cos6 — isin @. 


= cos + isin 0 


Adding the two equations gives 


, , cif 4 e~i0 
e? +e = 2c0s6, so —7 = 608 0. 
Subtracting the bottom equation from the top equation gives 
cif — e~it 
e? — e = %sin 8, so = sin 0. 
i 


These are the formulas that we need. 
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e9 + e7 
cos = 
2 
i 10 e~t 
Simo = - 
2i 


The next example shows you how you can use the first of these formulas to 
obtain a trigonometric identity. (In fact this trigonometric identity is one 
of the identities from Example 19, in a rearranged form.) You also have to 
use the rules below, which you met in the previous subsection. 


® Example 20 Using the formula cos 0 = +(e’ +e") to obtain a 
—_ trigonometric identity 


Use the formula cos 0 = - (e? + eL) to obtain the identity 
cos? 0 = Í (cos 30 + 3 cos 6). 
Solution 


We have cos 0 = 4 (ce? + e~"*). Therefore 
lle res 
cos? 6 = z (e® + o) 
@®. Use the binomial theorem. ® 


il f ; 
= (Cay a Be ren” a Be“ (e 2)? at E) 
@. Simplify by using the rule (e*°)” = e®. @ 
_l o + 3e emi? 4 3e e720 4 T) 
8 
@. Simplify by using the rule e?et? = e+), @ 
_ z a + 36! + 3e? + ae, 


@. Rearrange to obtain expressions of the form G +e), and 
then use the formula given in the question again. ® 
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1 3i0 —3i0 10 —i0 
eG idea ae ara 
4 2 2 


il 
= (cos 30 + 3 cos 0). 


Activity 42 Using the formulas cos 0 = + (e® + e~*) 
and sin 9 = + (e” — e™™) to obtain trigonometric identities 


Use the formulas 
E f 1 . ‘ 
cos = = (e + > and sinb = — (e — e=) 
2 24 
to obtain the following identities. 


(a) sin? 0 = 4 (3sin 0 — sin 30) (b) cost 0 = $ (cos 48 + 4 cos 20 + 3) 


The examples and activities in this section have shown you how to use 
complex numbers to deduce new trigonometric identities that express 
sinnô and cos nô in terms of powers of sin 0 and cos 0, and vice versa. This 
should have given you some idea of the power of complex numbers, and an 
appreciation of the fact that their uses are far from ‘imaginary’. 


Quaternions 


You’ve seen that when you work with complex numbers, you’re 
performing arithmetic with pairs of real numbers. The Irish 
mathematician William Rowan Hamilton was the first to realise how 
to perform arithmetic with quadruples of real numbers. He did this, 
in a flash of inspiration, while walking beside the Royal Irish Canal 
near Dublin on 16 October 1843. His idea was to use numbers of the 
form a + bi + cj + dk, where a, b, c and d are real numbers, and 

P =p =k? =k = —1. 
Hamilton scratched this foundational equation on to Broom Bridge as 
he passed by. His scratches are no longer visible and instead a plaque 
records the event. Hamilton called his new system of numbers the 
quaternions. You can perform all the usual arithmetic operations 
with quaternions, with one crucial difference: multiplication is not 
commutative. For example, you can show that 


The plaque on Broom Bridge, 
Dublin, that marks 
Hamilton’s discovery 


ij =k, whereas ji =—k. 


Soon after the discovery of the quaternions, a system of arithmetic 
with octuples of real numbers was developed, called the octonions 
(or Cayley numbers). Multiplication of octonions is neither 
commutative nor associative. This makes them tricky to manipulate! 
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lm sorry, the 
number you have 
dialled is imaginary. 
Please rotate your phone 
through 90 degrees 
and redial. 
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Learning outcomes 


After studying this unit, you should be able to: 


e understand what complex numbers are, and carry out arithmetical 
operations on them 


e work with the complex plane 
e understand modulus and argument 


e understand the polar form of a complex number, and convert between 
Cartesian form and polar form 


e multiply and divide complex numbers in polar form 

e understand and use de Moivre’s formula 

e find all solutions of quadratic equations, including complex solutions 
e find roots of complex numbers 

e understand the fundamental theorem of algebra 

e state Euler’s formula 


e understand the exponential form of a complex number, and convert 
between this form and other forms 


e use de Moivre’s formula and Euler’s formula to obtain trigonometric 
identities. 


Closing remarks 


Closing remarks 


Well done for completing MST124 (or skipping to the last page)! You’ve 
covered a broad range of topics, which will provide you with the essential 
mathematical skills that you need to develop as an engineer, scientist, 
economist or mathematician. 


Let’s review some of the key topics that you met, and see how they fit 
together. You learned a good deal about functions, which are among the 
most fundamental objects in mathematics. The exponential function is of 
particular significance, not least because of its use in exponential models of 
real-life situations. The trigonometric functions are also of great 
importance, and their many practical applications include measuring 
distances and modelling waves. In this final unit you met Euler’s 
remarkable formula et? = cos 0 + isin 0, which brings together the 
exponential and trigonometric functions by using complex numbers. With compound interest, 
savings increase exponentially 


Another class of functions that you studied are the polynomial functions. 
You saw that you can approximate functions by Taylor polynomials, which 
is often extremely useful for understanding properties of functions and 
performing calculations with them. 


You saw that functions and sequences can be represented geometrically by 
their graphs. Graphs help to quickly communicate the key properties of 
functions and sequences, and they also provide a simple way to represent 
physical quantities such as velocity and acceleration. 


Through studying gradients of graphs you came to learn about 
differentiation, and then about integration, which is the reverse of 
differentiation. The theory surrounding these two concepts, which is 
known as calculus, is an essential part of almost every discipline that 
involves mathematics, from finance to medicine. You’ll make good use of 
calculus in higher-level modules involving mathematics. 


In Unit 5 you learned about another way of modelling real-life phenomena, 
namely by using vectors. Trigonometry is crucial when you’re working 
with vectors, because you can use it to calculate the directions of vectors. The acceleration of a diving 
You also learned some geometry in Unit 5, and then, in this final unit, you Peregrine falcon can be 
saw how by using the complex plane you can associate the rich structure of represented by a graph and 
the complex numbers with two-dimensional geometry. analysed using calculus 


Yet another central topic that you encountered is matrices, which have a 
huge range of applications, in modelling networks, solving linear equations, 
higher-dimensional calculus, and many other topics. 


The MST124 authors hope that you feel inspired to continue to study 
mathematics, whether it’s through engineering, science, statistics or any 
other subject with a mathematical component. 
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Solutions to activities 


Solution to Activity 1 


Solution to Activity 2 
(a) (2+ 5%) + (—7 + 13i) = (2+ (—7)) + (5 + 13): 


=—5 + 18i 
(2+ 5i) — (—7 + 13i) = (2 — (—7)) + (5 — 13)i 
=9-8i 
(b) (—42) + (—92) = ((—4) + (—9))i 
= —13i 
(—42) — (—92) = ((-4) — (-9))¢ 
=5i 
(c) (3—7i) + (3 — 7i) = (3 + 3) + ((-7) + (-7))i 
=6-14i 
(3 — Ti) — (3 — 7i) = (8-3) + ((-7) - (-7))i 
=f) 


(d) (3+ 7i) + (3— 7i) = (3 + 3) + (7 + (-7))i 
6 

(3 — 3) + (7 — (=7))i 
14i 


6+7- 6G- 


ii 
oO 
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(f) z+w = 1.2 + 3.4i 
z— w = 1.2 — 3.4i 
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Solution to Activity 3 

(a) u+ v +w = (4+ 6i) + (—3 + 5i) + (2 — i) 
= (4+ (—3) + 2) + (6+5 + (-1))i 
=3+102 

(b) By part (a), 

wtvtu=utu+w=34 10. 

(c) u-(v+w)=u-v—-—w 
= (4+ 6i) — (—3 + 5i) — (2 — i) 
= (4 — (—3) — 2) + (6 — 5 — (—1))i 
=5+21 

(d) u-(v—w) =u-v+w 
= (4+ 6i) — (—3 + 5i) + (2 — i) 
= (4 — (—3) + 2) + (6 — 5 + (—1))i 
=9 


Solution to Activity 4 
(a) (1 +3i)(2 + 4i) = 2 + 4i + 6i + 127? 
= 2 + 4i + 6i — 12 
= —10+ 10i 
(b) (—2 + 3i)(4 — 72) = —8 + 14i + 12i — 217? 
= —8 + 14i + 12i + 21 
=13 + 26i 
(c) 3i(4— 5i) = 12i — 151° 
= 12i +15 
=15+12i 
(d) 7(—2 + 5i) = —14 + 35% 
(e) (2 — 3i)(2 + 3i) = 4 + 6i — 6i — 9i? 
=4+6i—6i+9 
=13 
(f) G++) =$4414+i14+47 
=4+hi+i-3 


_ 5; 


Solution to Activity 5 
(a) u(v +w) = (14+ 22)((4 — 3i) + (—i)) 
= (1+ 2i)(4— 47) 
= 4 — 4i + 8i — 87” 
=4—4i+8i +8 
= 12 + 4i 
(b) By part (a), 
uv + uw = u(v + 
(c) wow = (1 + 2i)((4 — 3i)(~i)) 


(v +w) = 12 + 4i. 
( 

= (1 + 2i)(—4i + 37) 
(= 


= (1 + 2i)(—3 — 4i) 
= —3 — 4i — 6i — 8i? 
= —3 — 4i — 6i +8 
=5-— 10i 
Solution to Activity 6 

i? =1 

il =i 

il 

È = (i)i = (-1)i = -i 

i = (Ji = (—i)i = —i? = 1 

i = (iji = (Iji = i 


i6 = (i)i = (iJi = —1 


The powers of i are, in order, 


1, i,=1;, 0 1, =i, 1, i, =l; =i a 
D a a 


This pattern repeats 


Solution to Activity 7 
Since 
(3i)? = 3772 = 9 x (-1) = —9, 
it follows that 3i is a square root of —9. Since 
(—32)? = (—3)72? = 9 x (—1) = -9, 
it follows that —3i is also a square root of —9. 
Solution to Activity 8 
(a) 4—2i 
(b) —3 + 8 
(e) - 
(d) 5 


Solutions to activities 


Solution to Activity 9 


As z = a + bi, it follows that w = Z =a — bi. 
Therefore 

W=a-bi=at+h=z. 
Solution to Activity 10 
Let z = a + bi. 
(a) z +Z = (a+ bi) + (a — bi) = 2a = 2 Re(z) 
(b) z -Z = (a + bi) — (a — bi) = 2bi = 2i Im(z) 
Solution to Activity 11 
(a) (2 +3i)(2 — 3i) = 2 +3°=4+9=13 


(b) (—1 = 2i)(—1 + 2i) = (1)? + (2)? =1+4=5 
(c) (5i) x (—5i) = 5? = 25 
(d) (-2) x (-2) =4 
Solution to Activity 12 
— —2 — 3i 
(8) F3 > Caysn2— 30 
= —2—3i 
~ P+ 
_ —2— 3i 
13 
2i _2i(1—4) 
Ura G+H- i) 
_ 22i? 
~ PI 
242% 
7 2 
=1+i 


(c) To simplify this fraction, you could multiply the 
top and bottom of the fraction by the complex 
conjugate of the denominator, namely —2i. 
However, it’s simpler to multiply the top and 
bottom of the fraction by —i, as follows. 

11—8i (11 — B8i)(—i) 
2i (~i) 
_ —lli + 8i? 

= 2%? 

—1lli-8 
2 

—8— 11: 

2 
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® Sa. Solution to Activity 14 
i 4x (=i) 1 (The lines in the diagrams below are included to 
1 i i help you see how the answers are obtained. For 
(e) = axi lIl example, in part (a) the parallelogram law is used 
(E A+ Ti _ (4 + 7i)(—1 — 2%) to ae z+w. Your solutions needn’t include such 
i+ (121-2) lines.) 
_ —4-8i— Ti — 14i? (a) 
CIHR 
10- 15i f ztw 
"e 7 
=2-—31 
(e) a - _ (8+3) - 3i) ? 
+3i (14 32)(1—- 3i) > 
_ 8— 241 + 31 — 977 
=— Es 
_ 17-21% 
© 10 
m 2i _ (—2 + 5i)(—4 + i) 
424 (4 i(-4Fi) (b) 
8 24 — 208 + 5i? 
eae) A 
B2 
17 
Solution to Activity 13 upra 22 
: > 
A, @ 
43 + 24 
° 
ZOA 
Lie $ (c) 
A 
i2 — 4t 
z 


XI 
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Solutions to activities 


(d) (c) [Oo] =0 
(a) |— 31) =31 
(e) |17i| = V172 = 17 


(f) |r- ivI| = 72+ (-vis) 


(g) 1 +i) = VP +12? = v2 
(h) [19 + 192] = V19? + 19? = V/192 x 2 = 192 


(i) |-i = yC =1 


Solution to Activity 16 
Since z = a + bi, it follows that 


=g === bi 
and 
(£) Z=a-— bi. 
Therefore |z|, |—z| and |Z| are all equal to Va? + b?, 
z so they are all equal. 
wi Solution to Activity 17 
1 ET 2i ae 
Zz Ge a = 
2+i |2+i?2 2+1? 5 
| 1 =E 
b = 
b) 3 |—1— 33/2 
ace -1 +3i 
z ape ee 
C+ (3 
ii 
~ 10 
Solution to Activity 15 (c) You could use the formula 
(a) [3 +4iļ = VB +P = VIF T6 = V5 =5 z= 
z k 
(b) |- 4+ 32] = y (4) + 3? to find 1/2i, but it’s easier to multiply the top 
= V164+9 and bottom of the fraction 1/2i by i to give 
= v25 oe SOE 
-5 2i 2A i AO 2X 
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Solution to Activity 18 
(a) 


NIA 


From the diagram, Arg(7i/2) = 7/2. 


(b) 


—4i 


From the diagram, Arg(—4i) = —7/2. 


(c) 


From the diagram, Arg(—3) = 7. 
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From the diagram, Arg(2) = 0. 


Solution to Activity 19 
(a) 


From the diagram, 
1 
tan@d =- = 1. 
1 
So the principal argument is 0 = 7/4. 


Alternatively, you may see immediately from 
the diagram that the principal argument is 7/4, 
because it’s half a right angle (7/2 radians). 


From the diagram, 
3 
tang = ue = V3. 
Therefore ¢ = 7/3. So the principal argument is 


T 


(c) 


From the diagram, 


1 
tan ọ = —=. 
a 
Therefore ¢ = 7/6. So the principal argument is 
T OT 
(n-$)=-(r- 2) =- 


DCE 
z = 2/3 — 2i 


Solutions to activities 


From the diagram, 


oe 2 1 
an Q = —= = —. 
2v3 v3 
Therefore ¢ = 7/6. So the principal argument is 
T 
0 = —-ọ = —-. 
i 6 


Solution to Activity 20 
(a) 3(cos0 + isin0) = 3(1 +0) = 3 


(b) 7 (cos (5) + ésin (5)) =7(0 +i) = 7 
(c) 6(cosr + isin r) = 6(—1 + 0) = —6 
(d) cos (-5) +isin (-<) =0-i=-i 
(e) 5(cos(—7) + isin(—7)) = 5(—1 + 0) = —5 
TA |. /T 1 v3 
O a(o (F) tisa (F)) =a (3) 
=2+2V3i 
(g) 2 (cos (-F) +isin (-7)) =2 (= : =) 
= V2 — iv2 


3 
2 2 
Solution to Activity 21 
(a) The modulus is 7/2. From Activity 18(a), the 
principal argument is 7/2. Therefore 


Gs of T se 
i= 5 (008 (5) + (3)) 
(b) The modulus is 4. From Activity 18(b), the 
principal argument is —7/2. Therefore 


—4i = 4 (cos (-7) +isin (-5)) i 
(c) The modulus is 3. From Activity 18(c), the 
principal argument is m. Therefore 
—3 = 3(cos m + isin 7). 
(d) The modulus is 2. From Activity 18(d), the 
principal argument is 0. Therefore 
2 = 2(cos0 + isin 0). 
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(e) 


The modulus is 


r= y1? +12 = VI FI = V2. 
From Activity 19(a), the principal argument is 
m/4. Therefore 


1+i= v2 (cos (7) + isin (Z)). 


The modulus is 


r= y 12 + (-v3)?? = VI F3 = V4 =2. 


From Activity 19(b), the principal argument is 
—r/3. Therefore 


1—iv3=2 (cos (-5) +isin (-5)) ; 
The modulus is 


r= y (-v3} + (-1} = V3F I= v4 =2. 


From Activity 19(c), the principal argument is 
—5r/6. Therefore 


-V3-i=2 (cos (-=) + isin (-=)) . 


The modulus is 


r= 4/ (v3)? + (-2)2 = V21 = V6 =4 
From Activity 19(d), the principal argument 
is —7/6. Therefore 


2/73 -—2i=4 (cos (-2) +isin (-Z)) : 


Solution to Activity 22 


(a) 


(b) 
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i0)) 


zw = 48 (cos (Z + T) | isin (2 + 
7 5 ' 10] — 5 


zw = 40 (cos (= + =) 
8 4 
= 40 (cos (=) +isin (=)) 
8 8 


The angle 137/8 lies outside the interval 
(—7, 7], so it isn’t the principal argument of zw. 
The principal argument is given by 


137 On = 137 16r O 3T 
s 72g B83 g 
Therefore 


3T 
zw = 40 (cos (-3) 


(c) ny pem R 
c Zw = Cos 9 i 9 9 


167 isi 16r 
= cos | —— isin | — 
co 9 9 


The angle 167/9 lies outside the interval 
(—7, 7], so it isn’t the principal argument of zw. 
The principal argument is given by 


167 16r 187 27 
m= = . 
9 9 9 9 
Therefore 


2T 4 isi 27 
= cos | —— isin | —— |. 
ZW = CO 9 9 


Solution to Activity 23 
The product wuw equals 


elec e a 
a Oe treg 


a i 5) +ism (2-4) 
aG) 


(b) 


Elx 


— 
O 
~ 


= cos 0 + ¿sin 0 


In this case, the Cartesian form of z/w is 1 (as 
you would expect since z = w), which is simpler 
than the polar form. 


Solutions to activities 


Solution to Activity 25 Hence 


(140° = (VÐ? (cos (3 x 7) + isin (3 x 7) 


A 


v2 
T . 
ao 0) (2 (o E) G” 
= 212/3 (cos (=) +7sin (=)) 

From the diagram, Ar (Ar 

—i = cos (-3) +7sin (-3) ‘ Ti (cos (=) a (=)) 
Let z = r(cos 0 + isin 0). Then ak (3-2) 

z x (—i) 7 2 2 

= r(cos 0 + isin 8) x (cos (-3) +isin (-$)) = —8 — 8V3i. 

5 (cos (0- ) Feda (0- 7 l (c) The modulus of —1 + V3 is 


(—1)? + (V3)? = V1 F3 = V4 =2. 


The principal argument of —1 + i/3 was found 
to be 27/3 in Example 5 . Therefore 


-1+iv3=2 (cos (=) + isin (=)). 


Hence 


(—1 + iv3)’ 


2 2 
=2" (cos (7 x =) + isin (z x zy) 
— 128 14r me 14r 
= cos 3 isin 3 . 


Therefore multiplying z by —i corresponds to a 


5i 
clockwise rotation through a quarter turn ery ae on 
(—7/2 radians). 3.7 4r + 3° 
Solution to Activity 26 coe = 
. T es . o 
(a) The modulus of 1 +7 is (—1+iv3)" = 128 (cos (=) vr (=)) 
VP2+2=V1Fl=v2. _ oa (2 8 
The principal argument of 1 +i was found to 7 9 X oe 
be 7/4 in Activity 19(a). Therefore 
l ae YT = 64(—1 + iv3). 
1+i= V2 (cos (7) +7sin 6) ; 
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(d) The modulus of V3 + i is 


(V3)? +1? = V34+1=Vv4=2. 


From the diagram, 


1 
tan@ = —=. 


v3 


Therefore the principal argument is 0 = 7/6. So 


V3+i=2 (cos (=) +7sin (=)) : 
Therefore 


(V3 + 1)~6 
—2-6 (cos (-6 x =) +isin (—6 x =)) 


= = (cos(—7) + isin(—r)) 


64° 
(e) The modulus of 2 + 2i is 


VEFE =VIFi= VE = 23. 


2+ 20 


ALA 
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From the diagram, the principal argument is 
m/4. Therefore 
; T ce (0 
2+2i = 2V2 (cos (=) +7sin 6) : 
Hence 


(2+ 2i)" 


= (2V2) (cos (-=) + isin (-=)) 


Since 


May 3T 
= T = — 
4 4’ 


we obtain 
(2+2) 


Solution to Activity 28 
(a) Completing the square on the left-hand side 


gives 

(z-1)} +1=0; 
that is, 

(z=1) ==], 


Taking square roots of both sides gives 


z=] =F, so z=1+21. 
(b) Completing the square on the left-hand side 
gives 
(z+ 2)?+9=0; 
that is, 
(z+ 2)? =-9. 
Taking square roots of both sides gives 
z+2=+31, so z= —?2 +3i. 


(c) Subtracting 25 from both sides gives 
z2 = —25. 
Taking square roots of both sides gives 


z= +51. 


Solution to Activity 29 


(a) ya Det Ve -4x1x2 
E 2x1 
_ ~2avV—4 
7 2 
— -242i 
~ 9 
=-l+i 


(b) You could solve this quadratic equation by 
using the quadratic formula, but it’s easier to 
solve it by factorising, as follows: 

z2? +6z+9=0 
(z+3)? =0 
z+3=0 
z=-3. 
(c) You could solve this quadratic equation by 


using the quadratic formula, but it’s easier to 
solve it by using a simple rearrangement. 


Subtract 5 from both sides of the equation to 


give 
32? = —5; thatis 2° = -2. 
Therefore 
z i z v5 
= +i4/| = = ti—. 
3 v3 
Rationalising the denominator gives 
„u5, y3 y5 
AR fs 
4+,/(—4)?-4x1x8 
i. = om.) 
2x1 
_4+V/-16 
E 2 
44i 
= 2 
=2+21 


(e) You could solve this quadratic equation by 
using the quadratic formula, but it’s easier to 
solve it by factorising, as follows: 


27 +32=0 
z(z+3) =0 
Z:=() OF 


z=0 or 


Solutions to activities 


3+ /(-3)? -4x2x5 
2 2x2 
_ 34+ /-31 
E 4 
_ 3+ivy31 
4 


Solution to Activity 30 


(a) A suitable quadratic 


equation is 


(z — (1 + 2i))(z — (1 — 2i)) = 0. 
Simplifying gives 
((z — 1) — 2i)((z— 1) + 27) =0 
(z —1)? — (21)? =0 
z2? —2z+1+4=0 
z —2z+5=0. 
(b) A suitable quadratic equation is 
(z — (—3 + 47))(z — (—3 — 4i)) = 0. 
Simplifying gives 
((z + 8) — 4i)((z + 3) + 4i) =0 
( +3)? — (41)? =0 
z? +6z+9+16=0 
z2? +6z +25 = 0. 
(c) A suitable quadratic equation is 
(z — Ti)(z — (—7i))= 0 
Simplifying gives 
-— (Ti)? = 
z? +49=0. 


(d) A suitable quadratic 


equation is 


(z— (1+ 34))(z 
Simplifying ke 

((z— 1) — 34)((z 

(2-1)? — (34)? = 


(1—5%)) =0. 


Sees 
0 


2 — 2z +3 = 
4z? — 8z +5=0. 
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Solution to Activity 32 
(-1 — iva) =14 0542 (v3), 


= 1 +2iv3 -3 
= —2 + 2i V3. 
Therefore 


Cai = (1-08) (1) 
z (-2 + 2iv3) (-1 ale 


2 
= 2 + 21/3 — 21/3 — 21? (v3) 
=2+6 
= 8. 

Hence 
3 ; 3 
-1 —ivV3\  (=1-iv3) 8 4 
2 E 23 8o 

(There’s another, quicker solution to this activity. 
Let w = 4 (—1 +iv3), so that W = 4 (—1 — iv3 ). 
You’ve already seen that w is a cube root of unity, 
and now you’re asked to show that W is also a cube 
root of unity. To do this, take the complex 
conjugate of each side of the equation 


w= 1 
to give 

w=]. 
Since 


w = wxxw = Ux Tx T= T, 
it follows that 
w? = 1.) 
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Solution to Activity 33 
(a) Let z = r(cos 0 + isin 8). Then 
r?(cos 30 + isin 30) = cos 0 + isin 0. 


Comparing moduli gives r? = 1, so r = 1. 


Comparing arguments gives 
30 = 0 + 2mr = 2mrz, 


Hence 


9— a 
3 


Taking m = 0,1,2 gives the solutions 


where m is an integer. 


zo =cos0+isin0=1 


2 2 1 3 
Z1 = COS (=) +isin (=) = —— 443 


ae Ar rer An 1 : 
Z = — in | — | = -= — i—. 
a ge 3 2 9 


where m is an integer. 


V3 


All other values of m give repetitions of these 


three solutions. 


(b) Let z = r(cos 0 + isin 8). Then 
r*(cos 4 + isin 40) = cos 0 + isin 0. 
Comparing moduli gives r4 = 1, so r = 1. 


Comparing arguments gives 


40 = 0 + 2mr = 2mr, where m is an integer. 


Hence 


MT . : 
d= Ji where m is an integer. 


Taking m = 0, 1,2,3 gives the solutions 
zo = cos 0 + isin 0 = 1 


= oos (5) +ésin (3) =! 
zı = cos 5 isin 5 =i 


Zə = cos Tt + isin mt = —1 


_ 37 4 isi 37 oah 
Z3 = COS T isin a =-i. 


All other values of m give repetitions of these 


four solutions. 


Z1 
Z2 (ale 
TE Zo 
23 


Solutions to activities 


(c) Let z = r(cos 0 + isin 8). Then 
r°(cos 69 + isin 68) = cos 0 + isin 0. 
Comparing moduli gives rê = 1, so r = 1. 


Comparing arguments gives 


60 = 0 + 2mm = 2m7, where m is an integer. 


Hence 


MT . i 
0 = —, where m is an integer. 


3 


Taking m = 0,1, 2,3,4,5 gives the solutions 


zo = cos 0 + isin0 = 1 


Í 3 
Z1 = COS (5) +isin (5 = T 


2 2 1 3 
Z2 = COS (F) + isin (F) = -7 y 


z3 = cos T + isin m = —1 


_ Ar Jia AT 1 3 
Z4 = COS 3 isin 3 5 5 


5r _, (ia 1 V3 
z5 = cos | — |] +2zsin| — |] = = — 1—. 
3 3 2 2 


All other values of m give repetitions of these 


six solutions. 
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Solution to Activity 34 
(a) A polar form of the complex number 64 
is 64(cos0 + isin0). Let z = r(cos6 + isin 0). 
Then 
r°(cos 66 + isin 60) = 64(cos0 + isin 0). 
Comparing moduli gives rê = 64, so r = 2. 
Comparing arguments gives 
60 =0+2mz7, where m is an integer. 
Hence 


MT i à 
d= EL where m is an integer. 


Taking m = 0,1, 2,3,4,5 gives the solutions 
zo = 2(cos0 + isin 0) = 2 


Zi =2 (cos (=) +7sin (=) =1+iv3 


25 Ar a Ar 

z4 = 2 | cos 3 isin 5 
5 5 

= 9 (cos (=) eT (=)) EE e 


All other values of m give repetitions of these 
six solutions. 
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(b) A polar form of the complex number —8 


is 8(cos7 + isin r). Let z = r(cos@+ isin 0). 
Then 


r?(cos 36 + isin 30) = 8(cosm + isin 7). 
Comparing moduli gives r? = 8, so r = 2. 
Comparing arguments gives 

30 = T + 2m7 = (2m + 1)r, 
where m is an integer. 


Hence 


where m is an integer. 
Taking m = 0, 1,2 gives the solutions 
T .. [T : 
262. (cos (=) + isin (=) =1+iv3 
zı = 2(cosa+isinaz) = —2 


22 = 2 (cos (=) + isin (=)) =1-iv3. 


All other values of m give repetitions of these 
three solutions. 


(c) The modulus of —1 — i is 


(—1)? + (-1)? = VI FI = v2. 


om 


ae 


71 


= 


From the diagram, 
OT 
Arg(—1 — i) = —. 
rg( q=] 
Therefore 


-1 =i = V3 (cos (=) + isin (=) l 


Let z = r(cos 0 + isin 0). Then 
r° (cos 50 + isin 50) 


=(= (5) s ($). 


Comparing moduli gives r5 = v2, so r = 21/10, 


Comparing arguments gives 
50 = 5r/4+ 2m7, where m is an integer. 


Hence 
9=— T P 2MT 
4 5”? 
Taking m = 0, 1,2,3,4 gives the solutions 


where m is an integer. 


Zo = 21/10 (cos (7) + isin 6) 
137 137 
_ 91/10 SINY. tagia foe 
zZ 2 (cos ($7) isin (7) ) 
21 21 
i 21/10 | cos all ere isin = 
20 20 
297 297 
_ 91/10 BON: ie el of ee 
zZ 2 (cos (2) isin ($7) ) 
37 
4 21/19 | cog li + isin a : 
20 20 
All other values of m give repetitions of these 


five solutions. 


Solutions to activities 


Solution to Activity 36 
(a) 3e® = 3(cos0 + isin 0) 
= 3(1+ 0) 
=3 
, T T 
(b) 7e'™/? =7 (cos (5) + isin (5)) 
=7(0 +i) 
=i 
(c) 6e = 6(cos m + isin r) 
= 6(—1 +0) 
=—6 
(d) e~*"/? = cos(—7/2) + isin(—7/2) 
=). =4 
Z 
(e) 5e-** = 5(cos(—m) + isin(—7)) 
=5(14-0) 


(£) 4e"/? = 4 (cos (=) + isin (=)) 
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5ri/6 om „on [3T 
(h) V3e = V3 | cos T +isin T 


3 3 
2 2 


Solution to Activity 37 


(a) From Activity 21(a), the modulus is 7/2 and 
the principal argument is 7/2. Therefore 
Too T 
= E 
or 
(b) From Activity 21(b), the modulus is 4 and the 
principal argument is —7/2. Therefore 
—4i = 4e7 7/2. 
(c) From Activity 21(c), the modulus is 3 and the 
principal argument is 7. Therefore 
—3 = 3e". 
From Activity 21(d), the modulus is 2 and the 
principal argument is 0. Therefore 
2 = 2e”. 
(e) From Activity 21(e), the modulus is v2 and the 
principal argument is 7/4. Therefore 
1+i=V2e'"/4. 
(f) From Activity 21(f), the modulus is 2 and the 
principal argument is —7/3. Therefore 
1 —iv3 = 2e7 77/9. 
(g) From Activity 21(g), the modulus is 2 and the 
principal argument is —57/6. Therefore 
-V3 — i = 26-576, 
From Activity 21(h), the modulus is 4 and the 
principal argument is —7/6. Therefore 


2V3 — 2i = 4e*"/6, 


in/2 
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Solution to Activity 38 


Because 
r(cos@+isin@) = ret? 

s(cos¢ + isind) = se’? 

~(cos(6 — ¢) + isin(0 — ¢)) = AV, 


the formula for the quotient of two complex 
numbers in polar form can be written as 
i0 
re L T ilO-e), 
se? s 


Solution to Activity 39 

We use the formulas in the boxes above the activity, 
and the formula from Activity 38. That is, we use 
the usual index laws. 

(a) et7/8 x e377 /8 = ei (t/8+31/8) 


— et(47/8) 
— eit /2 
(b) (e — pi(4xn/8) — pin/2 
3ri/8 
e i — et(37T/8-7/8) 
en 
= et(27/8) 
a etT/4 
in/8 
e i(n/8—3r 
(d) ae /8—31/8) 
— et( 727/8) 
= eit/4 


Solution to Activity 40 
Euler’s formula gives 

e942") — cos(0 + 2m) + isin(0 + 27). 
Since 

cos(0 + 27) =cos@ and 
it follows that 

(+2) — cos 9 + isind = e”. 


sin(@ + 27) = sin 0, 


Solutions to activities 


Solution to Activity 41 Solution to Activity 42 
For brevity, write c = cos@ and s = sin 0. Then, by (a) By the given formula for sin@ and the binomial 
de Moivre’s formula and the binomial theorem, theorem, 
ote = maa 4 1 
cos 46 + isin 40 = (cos 0 T sin 0) E T ae ( i0 gu 
= (c + is) (2i) 
= į +4 (is) + 6c? (is)? — = (Ce)? + 3(€)2(—-e- #9) 
—8i 


+ Ac(is)® + (is)4 


10; ,-i0)2 | (103 
= cf + dic’s — 6s? — dics? + st + 3e"(—e *’)* + (-e )’) 


= (c* — 6c*s? + st) + 4ics(c? — 5”). = (e? See E geo e 3°) 
Comparing real and imaginary parts, and using the 1. 2 bib - gp 
identity c? + s? = 1, gives =- (e de” + 3e or) 
cos 40 = c* — 6c?s? + st 1 ; i i 3i 
4 2 2 2 2 = = (3(e eae ae ”)) 
= c —6c*(1-—c*)+(1-c*)(1-c*) 8i 
i0 _ „—ið 310 _ „—3i0 
=é 6e 458 ie a at Py cies eae 
= &c* — 82 +1 4 2i 2i 
1 
= 8cos* 0 — 8 cos’ 0 + 1 = 7 (Bsind — sin 38). 
and 


os (b) By the given formula for cos @ and the binomial 
sin 40 = 4sc(c* — s*) theorem, 


| = Asin as A(cos” 0- sin? 0), cost 0 = = (e 4 ew ivy! 
which are the required identities. 2 
(Another way to obtain the identities in this ie ((e%*)* Ata Pe Glet (eo)? 
activity is to apply the double-angle identities 16 sip ated ae 
sin2@=2sin@cos@ and  cos20 = cos? 0 — sin? 0 i PAE T AT] ) 
repeatedly. For example, =e (e? + 4e eT? t Ge? eT? 
sin 40 = sin(2(26)) 4 hei e73 y ga) 
= 2sin 26 cos 20 1 5 
= 4i0 2i —2ið | „—4ið 
= 2(2 sin 0 cos 0) (cos? 0 — sin? 0) za + Ee" +6 kde ma] 
— 4s 29 an2 1 , , , 
4sin 0 cos 6(cos* @ — sin? @).) =- ((e® + e749) 4 4(€29 4 e28) 46) 
1 /eti? 4 e740 e219 4 e7210 
=p) ee | ae a |43 
oe 
1 
= (cos 40 + 4 cos 20 + 3). 
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